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Abstract. The 'classical' random graph models, in particular G{n,p), 
are 'homogeneous', in the sense that the degrees (for example) tend to 
be concentrated around a typical value. Many graphs arising in the real 
world do not have this property, having, for example, power-law degree 
distributions. Thus there has been a lot of recent interest in defining and 
studying 'inhomogeneous' random graph models. 

One of the most studied properties of these new models is their 'robust- 
ness', or, equivalently, the 'phase transition' as an edge density parameter 
is varied. For G(n,p), p = c/n, the phase transition at c = 1 has been a 
central topic in the study of random graphs for well over 40 years. 

Many of the new inhomogeneous models are rather complicated; al- 
though there are exceptions, in most cases precise questions such as deter- 
mining exactly the critical point of the phase transition are approachable 
only when there is independence between the edges. Fortunately, some 
models studied have this property already, and others can be approximated 
by models with independence. 

Here we introduce a very general model of an inhomogeneous random 
graph with (conditional) independence between the edges, which scales so 
that the number of edges is linear in the number of vertices. This scaling 
corresponds to the p — c/n scaling for G{n,p) used to study the phase 
transition; also, it seems to be a property of many large real-world graphs. 
Our model includes as special cases many models previously studied. 

We show that, under one very weak assumption (that the expected num- 
ber of edges is 'what it should be'), many properties of the model can be 
determined, in particular the critical point of the phase transition, and the 
size of the giant component above the transition. We do this by relating our 
random graphs to branching processes, which are much easier to analyze. 

We also consider other properties of the model, showing, for example, 
that when there is a giant component, it is 'stable'; for a typical random 
graph, no matter how we add or delete o(n) edges, the size of the giant 
component does not change by more than o(n). 
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1. Introduction 

The theory of random graphs was founded in the late 1950s and early 
1960s by Erdos and Renyi [HI EH]) who started the systematic study of the 
space Q{n,M) of all graphs with n labelled vertices and M = M{n) edges, 
with all graphs equiprobable. (Usually, one writes G{n, M) for a random 
element of Q{n,M).) At about the same time, Gilbert |51j introduced the 
closely related model G{n,p) of random graphs on n labelled vertices: a 
random G{n,p) £ G{n,p) is obtained by selecting edges independently, each 
with probability p = p{n). For many questions, such as those considered in 
this paper, the models are essentially equivalent (if p = M/{^), say). As 
Erdos and Renyi are the founders of the theory of random graphs, it is not 
surprising that both G{n,p) and G{n,M) are now known as Erdos-Renyi 
random graphs. 

In addition to these two 'classical' models, much attention has been paid 
to the space of random r-regular graphs, and to the space Q{k — out) of ran- 
dom directed graphs where each vertex has out-degree k, and the undirected 
graphs underlying these. All these random graph models are 'homogeneous' 
in the sense that all vertices are exactly equivalent in the definition of the 
model. Furthermore, in a typical realization, most vertices are in some 
sense similar to most others. For example, the vertex degrees in G{n,p) or 
in G{n, M) do not vary very much: their distribution is close to a Poisson 
distribution. 

In contrast, many large real-world graphs are highly inhomogeneous. One 
reason is that the vertices may have been 'born' at different times, with 
old and new vertices having very different properties. Experimentally, the 
spread of degrees is often very large. In particular, in many examples the 
degree distribution follows a power law. In the last few years, this has led 
to the introduction and analysis of many new random graph models de- 
signed to incorporate or explain these features. Recent work in this area 
perhaps started from the observations of Faloutsos, Faloutsos and Falout- 
sos |49| concerning scaling in real-life networks, in particular the power-law 
distribution of degrees in the 'internet graph', and similar observations con- 
cerning the 'web graph' made by Kleinberg, Kumar, Raghavan, Rajagopalan 
and Tomkins [HE]) and by Barabasi and Albert [Hj, who also looked at sev- 
eral other real world graphs. The latter two groups introduced two of the 
first models to explain these observations, using the ideas of 'copying' and 
of 'growth with preferential attachment', respectively. Observations of and 
proposed models for other networks followed, including protein interaction 
networks, telephone call graphs, scientific collaboration graphs and many 
others. Extensive surveys of the mostly experimental or heuristic work have 
been written by Barabasi and Albert |^ and Dorogovtsev and Mendes |42j . 

Some of the first rigorous mathematical results concerning (precisely de- 
fined variants of) these new models are those of Bollobas and Riordan , 
Bollobas, Riordan, Spencer and Tusnady Buckley and Osthus j^H] and 
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Cooper and Frieze HO]- For a partial survey of the rapidly growing body 
of rigorous work see Bollobas and Riordan [2^. Needless to say, surveys in 
an active area such as this quickly become dated, and there are already too 
many rigorous results in the field to list here. 

Perhaps the most striking and important result of Erdos and Renyi con- 
cerns the sudden emergence of the 'giant component', the phase transition 
in G{n,p) that occurs at p = 1/n: if c > is a constant, then the largest 
component of G{n, c/n) has order 0(log n) whp if c < 1, and order Q(n) whp 
if c > 1 (see Section |31 for the notation). In particular, a giant component 
of order B(n) exists (with high probability as n — > oo) if and only if c > 1. 
Over twenty years later, Bollobas 15 and Luczak (7Tj proved considerably 
sharper results about the exact nature of this phase transition: in particular, 
they determined the exact size of the 'window' in which the transition takes 
place. Further, very detailed results were proved by Janson, Knuth, Luczak 
and Pittel [SHI; Bollobas ^B] and Janson, Luczak and Rucihski |59j for 
numerous results and references. 

Our main purpose in this paper is to lay the foundations of a very general 
theory of inhomogeneous sparse random graphs. To this end 

• we shall define a general model that is sufficiently flexible to include 
exactly many of the specific spaces of inhomogeneous random graphs 
that have been studied in recent years, 

• we shall establish a close connection between the component struc- 
ture of a random graph in this model, the survival probability of a 
related branching process, and the norm of a certain operator, 

• we shall use these connections to study the phase transition in our 
model, examining especially the numbers of vertices and edges in the 
giant component, and 

• we shall prove results concerning the stability of the giant component 
under the addition and deletion of edges. 

In addition, we shall study various other properties of our model, including 
the degree distribution, the numbers of paths and cycles, and the typical 
distance between pairs of vertices in the giant component. Furthermore, we 
shall spell out what our results say about many specific models that have 
been studied previously. 

Although we shall give many examples throughout the paper, to motivate 
the definitions it may help to bear in mind one particular example of the 
general class of models we shall study. This example is the uniformly grown 
random graph, or c/j -graph, gII\c). Here c > is a parameter that will 

be kept constant as n varies, and the graph gI/\c) is the graph on [n] = 
{1,2, ... ,n} in which edges are present independently, and the probability 
that for i ^ j the edge ij is present is pij = min{c/max{i, j}, 1}, or simply 
c/j if i < J and c < 2. 
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A natural generalization of G{n,p) that includes the example above as 
a very special case is obtained by replacing the single parameter p by a 
symmetric n x n matrix (ptj) with < pij < 1. We write G{n, (pij)) for the 
random graph with vertex set [n] where i and j are connected by an edge 
with probability pij, and these events are independent for all pairs 
with i < j; see |^ p. 35]. We are interested in asymptotics as n ^ oo, 
usually with {pij) = (pjj(n)) depending on n. It seems difficult to obtain 
substantial asymptotic results for G{n,{pij)) without further restrictions; 
the model is too general. (However, for connectedness Alon [Jj proved a 
number of results.) 

Here we are mainly interested in random graphs where the average degree 
is 0(1); one of the main cases treated in this paper is pij = K{i/n,j/n)/n for 
a suitable function k on (0, 1]^. Taking K{x,y) = c/ max{x,y}, we obtain 

Gli\c). Many other graphs studied earlier by different authors can also be 
obtained by choosing K{x,y) suitably; see Sectional and the forthcoming 
papers |2U1 157j . A precise definition of the random graphs treated here will 
be given in Section [21 and some simple examples in Section^ 

The rest of the paper is organized as follows. In Section [21 we define 
the model G(n, k) we shall study, along with the branching process and 
integral operator to which we shall relate its component structure. 

In Subsection 13. II we present our main results on the giant component of 
G(n, k): under certain weak assumptions we obtain necessary and sufficient 
conditions for G{n, k) to have a giant component, show that when the giant 
component exists it is unique, and find its normalized size and number of 
edges. Further results are presented in the following subsections, on the 
'stability' of the giant component in Subsection 13.21 on small components 
in Subsection 13.31 on the degree sequence in Subsection 13.41 and on the 
typical distance between vertices of the giant component in Subsection 13.51 
In Subsection 13 . 61 we turn to the phase-transition in G{n, k); more precisely, 
we examine the growth rate of the giant component as it emerges. 

Since our model is very general, and the definition rather lengthy, special 
cases of the model play an important role in the paper; we have described 
one, G\!\c) already. In Section [11 we give several further simple examples, 
to illustrate the definitions and results of the previous sections. Towards the 
end of the paper, we shall discuss several other special cases more extensively, 
in particular describing the relationship to other models studied earlier; we 
consider these to be applications rather than illustrations of the model, and 
so present them after the proofs. 

The next several sections are devoted to the proofs of the main results; 
the reader interested primarily in the applications may wish to skip straight 
to Section 1161 We start by analyzing the branching process jC^ , proving 
results about this process that will help us relate G(n, k) to X^. The study 
of itself is not one of our main aims. In Section [3 we prove various 
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lemmas needed in Section El to prove the results about that we shall use 
throughout the paper. 

Next, we turn to preparatory results concerning G{n, k) itself, starting 
with simple approximation lemmas in Sectional basic results on the number 
of edges of G{n, k) are given in Section |H1 

Our main results about the existence and size of the giant component are 
proved in Section 13 using material from the previous sections; the reader 
who is interested only in the derivations of these results should read Sections 
121 to El The number of edges in the giant component is determined in 
Section ^1 

Sections^JtoElare devoted to the proofs of the results in Subsections 13.21 
to l3.6l broadly speaking, these proofs rely on the results up to Section j^l but 
not on each other, so the reader may safely omit any subset of these sections. 
The stability result is proved in Section ITTl the results on small components 
in Section ^1 the vertex degrees are studied in Section ^1 the distance 
between vertices in Section ITU and the phase transition in Section IT^ The 
latter results may be viewed purely as statements about a branching process, 
in which case the proofs need only the results of Sections [3 and 

In Section El we apply our general model to deduce results about several 
specific models, in particular ones that have been studied in recent years, 
and discuss the relationship of our results to earlier work. 

In Section El we give some simple results about paths and cycles in 
G(n, k), inspired by the work of Turova on a special case of the model 
(described in Section [TH|) . and show that a conjecture of hers holds under 
mild conditions. In Section El we discuss several (at least superficially) re- 
lated models as well as possible future work. Finally, in the appendix we 
give some basic results on random measures used throughout the paper. 



2. The model 

In this section we define the random graph model that we shall study 
throughout the paper, as well as a branching process and an integral op- 
erator that will be key to characterizing the component structure of this 
random graph. This section also includes various remarks on the defini- 
tions, including descriptions of several minor variants; the formal definitions 
may be understood without reference to these remarks. To make sense of the 
definitions, the reader may wish to keep in mind the model G\!\c) defined 
in the introduction. 

Our model is an extension of one defined by Soderberg |HH1- Let 5 be a 
separable metric space equipped with a Borel probability measure ^. We 
shall often suppress the measure /i in our notation, writing, for example, 
II lip for the norm in LP{S) = LP{S, fi), and 'a.e. on 5' for n-a.e. on S. Much 

of the time (for example, when studying gI!\c)), we shall take S = (0, 1] 
with /i Lebesgue measure. Throughout the paper, the 'kernel' /t will be a 
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symmetric non-negative function on 5 x 5. Further conditions on k will be 
given in Definitions 12.71 and 12.101 

For each n we have a deterministic or random sequence x„ = (xi, . . . , Xn) 

of points in S. Formally, we should write = {x^^\ . . . ,x^n^), say, as we 
assume no relationship between the ith elements of x„ and of x„'. However, 
this notation would be rather cumbersome, and it will always be clear which 
x„ an Xi is an element of. Writing 6x for the measure consisting of a point 
mass of weight 1 at x, and 



for the empirical distribution of x^, we shall assume that f„ converges in 
probability to /i as n ^ oo, with convergence in the usual space of probability 
measures on S (see, e.g., |13j). This condition has a simple down-to-earth 
description in terms of the number of Xi in certain sets A: a set A C 5 is a 
IJ,- continuity set if A is (Borel) measurable and fJ-idA) = 0, where dA is the 
boundary of A. The convergence condition Vn ^ means exactly that for 
every /i-continuity set A, 



see Appendix^for technical details. 

One example where 1)2. 2|) holds is the random case, where the Xi are 
independent and uniformly distributed on S with distribution /u (as in 
Soderberg jHH]); then (|2.2|) holds by the law of large numbers. 

We shall often consider S = (0, 1] with the Lebesgue measure ^; in this 
case, condition (|2.2|) has to be verified only for intervals (see Remark IA.3|) . 
For this pair (5,//) we shall have two standard choices for the (xj): the de- 
terministic case Xi = i/n, and the random case where the Xj are independent 
and uniformly distributed on (0, 1]. To express G]!\c) as a special case of 
our model, we shall take Xi = i/n. 

For later formal statements, we gather the preceding assumptions into 
the following definitions. 

Definition. A ground space is a pair (5,/x), where 5 is a separable metric 
space and /U is a Borel probability measure on S. 

Definition. A vertex space V is a triple (5, /i, (x„)„>i), where (5,/i) is a 
ground space and, for each n > 1, x„ is a random sequence (xi, X2, . . . , x„) 
of n points of 5, such that (|2.2j) holds. 

Of course, we do not need (x„)„>i to be defined for every n, but only for 
an infinite set of integers n. 

Definition. A kernel k on a ground space (5, /i) is a symmetric non- 
negative (Borel) measurable function on 5 x 5. By a kernel on a vertex 
space {S,ji, (x„)„>i) we mean a kernel on {S,n). 




(2.1) 



1=1 



Vn{A) := #{i : Xi e A}/n ^ ^i{A); 



(2.2) 
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From now on, unless otherwise stated, we shall always write a vertex 
space V as (x„)n>i), and x„ as {xi,X2, ■ ■ ■ ,Xn)- As noted above, the 

(distributions of) the individual xi depend on n; in the notation we suppress 
this dependence as it will always be clear which member of. 

Let K be a kernel on the vertex space V. Given the (random) sequence 
(xi, . . . , Xn), we let G^{n, k) be the random graph G^{n, (pij)) with 

Pij := mm{K{xi,Xj)/n,l]. (2.3) 

In other words, G'^(n, k) has n vertices {1, . . . , n} and, given xi, . . . , 
an edge ij (with i ^ j) exists with probability pij, independently of all 
other (unordered) pairs ij. Often, we shall suppress the dependence on V, 
writing G{n, k) for G^(n, k). We have described one example already: if we 
take k(x, y) = c/ max{x, y], with 5 = (0, 1], /i Lebesgue measure, and xi = 
Xj-"^ = i/n, then ()2.3() gives = min{c/ max{i, j}, 1}, so G^(n, k) is exactly 

the uniformly grown random graph G\I\c) described in the introduction. 
We shall discuss several other examples in Sections El and ITHl 

Remark 2.1. The random graph G(n, K)=G^(n, k) depends not only on k 
but also on the choice of xi, . . . , Xn- Much of the time, our notation will not 
indicate how the points chosen, since this choice is irrelevant for our 

results as long as (|2.2|) holds and certain pathologies are excluded (see (|2.9j) , 
Lemma 18.11 and Example 18.6)1 . The freedom of choice of xi, . . . , x,i gives 
our model flexibility, as shown by Proposition 19.31 Theorem 112.11 and the 
examples in Sections and El but does not affect the asymptotic behaviour. 
Of course, this asymptotic behaviour does depend very much on S and ^. 

In order to make our results easy to apply, it will be convenient to extend 
the definitions above in two ways, by allowing ijl{S) to take any value in 
(0, 00), and by allowing the number of vertices of G^{n, n) to be random, 
rather than exactly n. As we shall see later, this makes no essential dif- 
ference, and we shall almost always work with the n vertex model in our 
arguments. We shall consider the 'generalized' model only for the conve- 
nience of a reader wishing to apply the results in the next section, obviating 
the need for a separate reduction to the n vertex model in each case. All 
other readers may safely ignore the 'generalized' model, including the formal 
definitions that we now state. 

Definition. A generalized ground space is a pair {S,n), where 5 is a sepa- 
rable metric space and /x is a Borel measure on S with < /x(5) < 00. 

Let I C (0, 00) be any unbounded set, the index set parametrizing our 
model. Usually, / is the positive integers, or the positive reals. For com- 
patibility with our earlier definitions, we write n for an element of I, even 
though this need not be an integer. 

Definition. A generalized vertex space V is a triple (5, /x, (x„)„g/), where 
(5, fj,) is a generalized ground space and, for each n G /, x„ is a random 
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sequence (xi, X2, • • • , of points of S of random length Vn > 0, such that 
(|2.2jl holds, i.e., such that 

i^niA) := #{i : Xi G A}/n ^ ii{A) (2.4) 

as n G / tends to infinity for every /x-continuity set A] equivalently, Vn ^ fi. 

The definition of a kernel /t on a generalized ground space is exactly as 
before. Finally, given a kernel k on a generalized vertex space, for n S 
I we let G^{n,K) be the random graph on {1,2, . . . ,Vn} in which, given 
Xn = {xi, . . . each possible edge ij, 1 < i < j < Vn, is present with 

probability 

Pij ■.= mm{K{xi,Xj)/n,l}, (2.5) 

and the events that different edges are present are independent. 

Note that if V is a generalized ground space, then, applying (|2.4j) with 
A = S, we see that G^(n,K) has /u(5)n + Op{n) vertices. In both 1)2. 4(1 
and ()2.5() we divide by n, rather than by the actual number of vertices, or 
by fi{S)n; this turns out to be most convenient normalization. Roughly 
speaking, by conditioning on the sequences (x„), or by adding Op{n) iso- 
lated vertices, we may assume without loss of generality that the number of 
vertices is deterministic. Furthermore, multiplying k and the index variable 
n by some constant factor, and dividing n{S) by the same factor, leaves the 
edge probabilities pij unchanged. As the condition ()2.4|) is also unaffected 
by this transformation, the only effect on the model is to rescale the param- 
eter n, and we may assume without loss of generality that we have a vertex 
space rather than a generalized vertex space; see Subsection 18.11 

Remark 2.2. We regard our random graphs as indexed by n, and consider 
what happens as n — > oo. This is for notational convenience only; we could 
consider graphs indexed by some other (possibly continuous) parameter, m, 
say, such that the number of vertices Vm of the graph with parameter m 
tends to infinity. This superficial modification is covered by the definitions 
above: instead of considering graphs on 2" vertices, say, one can always 
consider graphs on n vertices with n restricted to an 'index set' / consisting 
of the powers of 2. 

The generalized vertex space setting also allows the number of vertices to 
be random. In other words, we may let {xj} be a point process on S, for 
example, a Poisson process of intensity n; see Examples 14.91 and 14.111 and 
Subsection 116.51 

Remark 2.3. Changing k on a set of measure zero may have a significant 
effect on the graph G^(n,fi;); see Example I8.f)| for instance. Indeed, if the 
Xi are deterministic, then G^{n,K) depends only on the values of k on a 
discrete set. This means that in our proofs we cannot just ignore measure 
zero sets in the usual way. Later we shall impose very weak conditions to 
control such effects; see Remark 12.81 
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Before turning to the key definitions, giving conditions under which we 
can prove substantial results about G^(n, k), let us make some remarks 
about some minor variants of the model. 

Remark 2.4. As an alternative to (|2..S|) (or (|2.5j) ). we could use k to define 
the intensities of Poisson processes of edges, and ignore multiple edges, so 
the probability pij that there is an edge between i and j would be given by 

Pij := 1 - exp{-K{xi,Xj)/n), (2.6) 

rather than by 1)2. The results below are valid for this version too; this 
can be shown either by checking that all arguments hold with only trivial 
changes, or by defining Kn{x,y) := n(l — exp(— k(x, y)/n)) and using the 
setting in Definition 12.91 

Another alternative, studied by Britton, Deijfen and Martin-L6f in a 
special case (see Subsection 116. 4p . is to let Pij/{1 — Pij) = ^{xi, Xj)/n, i.e., 
to take 

Pij := K{xi,Xj)/{n + K.{xi,Xj)). (2.7) 

Again, the results below remain valid; we now define Kn{x, y) := k{x, y)/{l + 
K{x,y)/n). 

Remark 2.5. In this paper we treat only simple graphs. One natural 
variation that yields a multigraph is to let the number of edges between 
i and j have a Poisson distribution with mean K{xi,Xj)/n. Under suitable 
conditions (e.g. that k is bounded), it is easy to see that whp there are no 
triple edges, and that the number of double edges is Op(l); more precisely, 
it has an asymptotic Poisson distribution with mean j ff k^, see Section [T71 
The underlying simple graph is just the graph defined in Remark 12.41 

Another variation (which can be combined with the previous one) is to 
permit loops by allowing i = j in the definition above. These variations do 
not affect our results on component sizes. 

Remark 2.6. Our model can be extended to a random graph process on 
a fixed vertex set describing an inhomogeneously growing random graph: 
Start without any edges and, given x„, add edges at random times given by 
independent Poisson processes with intensities K{xi,Xj)/n. (Ignore multiple 
edges.) At time t, we obtain the version of the random graph G{n, tn) given 
by (|2.6|) : cf. Remark 12.41 Alternatively, we may add edges sequentially, with 
each new edge chosen at random with probabilities proportional to 
this gives the same process except for a (random) change of time scale. 

Without further restrictions, the model G^{n,n) we have defined is too 
general for us to prove meaningful results. Indeed, the entire graph may be 
determined by the behaviour of k on a measure zero set. Usually, k is contin- 
uous, so this problem does not arise. However, there are natural examples 
with K discontinuous, so we shall assume that k is continuous a.e. rather 
than everywhere. With this weaker condition, to relate the behaviour of 
G^(n, k) to that of k we shall need some extra assumptions. The behaviour 



THE PHASE TRANSITION IN INHOMOGENEOUS RANDOM GRAPHS 



11 



of the total number of edges turns out to be the key to the ehmination of 
pathologies. 

As usual, we write e(G) for the number of edges in a graph G. Note that 
Ee(G'(n, {pij))) = Y.i<jPij^ so we have 

Ee{G^{n,K)) =E'^mm{K{xi,Xj)/n,l}. (2.8) 

i<j 

In well behaved cases, this expectation is asymptotically ff see, for 
example, Lemma l8. II 

Definition 2.7. A kernel k, is graphical on a (generalized) vertex space 
V = {S,fi, (x„)) if the following conditions hold: 

(i) K is continuous a.e. on 5 x 5; 

(ii) K e L^{S X S, n X fi); 
(iii) 



iEe(G^(n,K))-i|^ 



K{x,y)dfi{x)dn{y). 



(2.9) 



Note that whether k is graphical on V depends on the sequences x„. Also, 
as we shall see in Remark 18.41 if n is graphical on V, then so is ck for any 
constant c > 0. (This statement would be trivial without the min{-, 1} 
operation in the right-hand side of (|2.8j) . With this, it is still not hard to 
check.) 



Remark 2.8. Conditions (i) and (ii) are natural technical conditions; at 
first sight, condition (iii) is perhaps unexpected. As we shall see, some extra 
condition is needed to exclude various pathologies; see Example 18.61 for 
example. Condition (iii) is in fact extremely weak: the natural interpretation 
of K, is that it measures the density of edges, so the integral should be 



the expected number of edges, suitably normalized. Thus condition (iii) 
says that G^(n, k) has about the right number of edges, so if (iii) does not 
hold, K has failed to capture even the most basic property of the graph. 
What is surprising, is that this condition is enough: we shall show that 
the assumptions above are enough for n to capture many properties of the 
graph. 

In fact, in many circumstances, condition (iii) is automatically satisfied. 
Indeed, one of the two inequalities implicit in this definition, namely 

liminf iEe(G^(n,K)) 



> 



n 



52 



K{x,y)dfi{x)dfi{y), 



111 



holds when- 



always holds; see Lemma l8. II This lemma also shows that 
ever k is bounded and V is a vertex space. It also holds whenever V is 
a vertex space in which the Xi are (pairwise) independent and distributed 
according to fi. Moreover, condition (iii) is likely to hold, and to be easy 
to verify, for any particular model that is of interest. Proposition 18.91 shows 
that when (iii) does hold, so the normalized number of edges converges in 
expectation, then it also converges in probability. Note also that (|2.9j) holds 
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if and only if the corresponding relation holds for the variants of G^(n,K) 
discussed in Eemark \2A\ see R,emark 18.41 



In conjunction with condition |(iii)[ condition (ii) says that the expected 
number of edges is 0(n), so the (expected) average degree is 0(1). There 
are interesting cases with more edges, but they will not be treated here; cf. 
Section [H 



We can be somewhat more general and allow minor deviations in 1)2. 3() 
by letting k depend on n. This will ensure that our results apply directly 
to the various variations on the model discussed above. The conditions we 
shall need on a sequence of kernels are contained in the next definition. 

Definition 2.9. Let V = (5,/i, (x„)) be a (generalized) vertex space and 
let K be a kernel on V. A sequence (k„) of kernels on (5, fi) is graphical on 
V with limit k if, for a.e. (y, z) G 5^, 

Vn^y and z imply that Knivn, Zn) ^ K{y,z), (2.10) 



K satisfies conditions (i) and (ii) of Definition 12.71 and 

-EeiG^in,Kn))^l [[ ac(x, y) d/i(a:) d^(y). (2.11) 

Note that if k is a graphical kernel on V, then the sequence k„ with Kn = n 
for every n is a graphical sequence on V with limit k. Much of the time, 
members of a graphical sequence of kernels on V are themselves graphical 
on V. 

Much of the time, the conditions in Definition 12 . 91 will be all we shall need 
to prove results about G^(n, «;„). However, when we come to the size of the 
giant component, one additional condition will be needed. 

Definition 2.10. A kernel k on a (generalized) ground space (5,//) is re- 
ducible if 

3Ac S with < fi{A) < /x(5) such that k = a.e. on A x (5 \ A); 

otherwise n is irreducible. Thus k is irreducible if 

ACS and k = a.e. on A x {S \ A) implies ^i{A) = or /i(5 \ ^) = 0. 

(2.12) 

Roughly speaking, k is reducible if the vertex set of G^(n, k) can be split 
into two parts so that the probability of an edge from one part to the other 
is zero, and irreducible otherwise. For technical reasons, we consider a slight 
weakening of irreducibility. 

Definition 2.11. A kernel k on a (generalized) ground space (5, fx) is quasi- 
irreducible if there is a /i-continuity set S' C S with fJ,{S') > such that the 
restriction of n to S' x S' is irreducible, and k,{x, y) = if x ^ 5' or y ^ S' . 
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Remark 2.12. Given a quasi-irreducible kernel k and the associated graph 
Gn = G^{n, k), we can consider the irreducible restriction k' of k to S' x S' , 
and the corresponding graph G'^ = G^ (n, k') obtained from Gn by deleting 
the vertices with types in S \ S'; these vertices are isolated in G„. This 
graph is an instance of our model with a generalized vertex space V'; note 
that the number N' of vertices of G'^ is random. Thus, we may reduce 
suitable questions about quasi-irreducible kernels to the irreducible case. In 
our main results, the reader will lose nothing by reading irreducible instead 
of quasi- irreducible. We state some of the results for the quasi- irreducible 
case because this is all we need in the proofs (even without removing isolated 
vertices as above), and we sometimes need the quasi- irreducible case of one 
result to prove the irreducible case of another. 

2.1. A branching process. Let k be a kernel on a (generalized) ground 
space (5, /x). To study the component structure of G{n, k), we shall use the 
multi-type Galton-Watson branching process with type space S, where a 
particle of type x G 5 is replaced in the next generation by a set of particles 
distributed as a Poisson process on S with intensity k{x, y) dfj,{y). (Thus, the 
number of children with types in a subset A Q S has a Poisson distribution 
with mean k{x, y) d^{y), and these numbers are independent for disjoint 
sets A and for different particles; see, e.g., Kallenberg 61.) We denote this 
branching process, started with a single particle of type x, by X^^x). When 
/i(5) = 1, so ;U is a probability measure, we write for the same process 
with the type of the initial particle random, distributed according to /i. 

Let pk{K\x) be the probability that the branching process ?ii^{x) has a 
total population of exactly k particles, and let p>jt(K;x) be the probability 
that the total population is at least k. Furthermore, let p{k] x) be the proba- 
bility that the branching process survives for eternity. If the probability that 
a particle has infinitely many children is 0, then p{k; x) is equal to Poo{k] x), 
the probability that the total population is infinite; see Remark 15.21 

Set 

Pkii^) ■= / Pkii^;x)dp.{x), p{n) ■■= / p{n;x)dp{x), (2.13) 
Js Js 

and define p>k{i^) analogously. Thus, if p-{S) = 1, then p{k) is the survival 

probability of the branching process X^- Note that multiplying k by a 

constant factor c and dividing p by the same factor leaves the branching 

process Xk(x), and hence p{k;x) and pk{K;x), unchanged. However, p{k), 

for example, is decreased by a factor of c. 

Remark 2.13. As we shall see later, the branching process Xk(x) arises nat- 
urally when exploring a component of G{n, k) starting at a vertex of type x; 
this is directly analogous to the use of the single-type Poisson branching 
process in the analysis of the Erdos-Renyi graph G(n, c/n). In models with 
a fixed degree sequence, a related 'size-biased' branching process arises, as 
it matters how we reach a vertex. Here, due to the independence between 
edges, there is no size-biasing. 
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2.2. An integral operator. Given a kernel k on a (generalized) ground 
space {S,fi), let be the integral operator on {S,n) with kernel k, defined 
by 

{TJ){x) = k{x, y)f{y) dfi{y), (2.14) 

for any (measurable) function / such that this integral is defined (finite or 
+cxd) for a.e. x. As usual, we need never consider non-measurable functions; 
in future, we shall assume without comment that all functions considered are 
measurable. Note that T^/ is defined for every / > 0, with < T^/ < oo. 
If K G L^(5 X 5), as we shall assume throughout, then T^/ is also defined 
for every bounded /; in this case T^/ G L^i^) and thus T^/ is finite a.e. 
We define 

||r,|| := sup{||r«/||2 : / > 0, II/II2 < 1} < 00. (2.15) 

When finite, \\T^\\ is the norm of as an operator in L'^{S, fi); it is infinite 
if does not define a bounded operator in L^. Trivially, \\T^\\ is at most 
the Hilbert-Schmidt norm of T^: 

\\T^\\<\\T^\\hs -=11^111^3x8) = (^JJ'^^fiix,yfdfi{x)dti{y)^ . (2.16) 

We also define the non-linear operator <I>k by 

:= 1 - e-^^f (2.17) 

for / > 0. Note that for such / we have < T^/ < 00, and thus < < 
1. We shall characterize the survival probability p{k;x), and thus p{k), in 
terms of the non- linear operator showing essentially that the function 
X ^ p{k] x) is the maximal fixed point of the non-linear operator see 
Theorem \U.2[ 



3. Main results 

In this section we present our main results describing various properties 
of the general model G^{n, k„); some further general results will be given in 
the later sections devoted to individual properties. In Section we shall 
present results for special cases of the model, including several that have 
been studied previously. 

All our results are asymptotic, and all unspecified limits are taken as 
n — > 00. We use the following standard notation: for (deterministic) func- 
tions / = f{n) and g = g{n), we write / = 0{g) if f /g is bounded, / = 0,{g) 
if f /g is bounded away from zero, i.e., ii g = 0{f), and / = Q{g) if / = 0{g) 
and g = 0{f). We write / = o{g) if f /g 0. 

Turning to sequences of events and random variables, we say that an event 
holds with high probability (whp), if it holds with probability tending to 1 
as n 00. (Formally, it is a sequence En of events that may hold whp, but 
the n is often suppressed in the notation.) We write ^ for convergence in 
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probability. Thus, for example, if a G M, then X„ ^ a if and only if, for 
every e > 0, the relations Xn > a — e and < a + e hold whp. 

We shall use Op, Op and Qp in the standard way (see e.g. Janson, Luczak 
and Rucihski for example, if (Xn) is a sequence of random vari- 

ables, then Xn = Op{l) means "X„ is bounded in probability" and X^ = 

Op{l) means that Xn 0. Given a function f{n) > 0, we shall write 
Xn = 0{f{n)) whp if there exists a constant C < oo such that < 
Cf{n) whp. (This is written Xn = Oc{f{n)) in (SSI.) Note that this 
is stronger than Xn = Op{f{n)); the two statements can be written as 
3CVelimsup„P(|X„| > C/(n)) < e and Ve3Climsup„ P(|X„| > Cf{n)) < 
e, respectively. We shall use Xn = 0(/(n)) whp similarly. 

We denote the orders of the components of a graph G by Ci(G) > 
C2{G) > with Cj{G) = if G has fewer than j components. We 
let Ni^{G) denote the total number of vertices in components of order k, 
and write N>k{G) for ^j>i^Nj{G), the number of vertices in components 
of order at least k. 

We shall write a Ab and a V 6 for min{a, b} and max{a, b}, and use the 
same notation for the pointwise minimum or maximum of two functions. 

As noted in the previous section, a reader who wishes to understand the 
following results, rather than apply them to a specific model, may safely 
ignore all references to generalized vertex spaces. 

3.1. Existence, size and uniqueness of the giant component. Our 

first result gives a necessary and sufficient condition for the existence of a 
giant component in our model. 

Theorem 3.1. Let (k„) be a graphical sequence of kernels on a (generalized) 
vertex space V with limit k. 

(i) // \\Tk\\ < 1, then Ci(G^(n,K„)) = Op{n), while if \\T^\\ > 1, then 
Ci{G^in,Kn)) = e(n) whp. 

(ii) For any e > 0, whp we have 

-Ci{G^{n,Kn)) <p{K)+e. (3.1) 

(iii) // K is quasi-irreducible, then 

-Ci(G^(n,K„))^/5(K). (3.2) 
n 

In all cases p{k) < /i(5); furthermore, p{k) > if and only if \\Ti^\\ > 1. 

This result will be proved in Section 121 along with an additional result. 
Theorem 19. 101 concerning the distribution of the types of the vertices in the 
giant component. We have included the final statement about p{k) for ease 
of future reference, even though it is purely a statement about the branching 
process X^- As remarked above, p{k) can be found from the solutions of the 
non- linear equation / = ^^{f); see Theorem 16.21 
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Theorem 13 . 1 1 has several immediate consequences. As customary, we say 
that a sequence of random graphs Gn (with G(n) vertices in Gn) has a giant 
component (whp) if Ci(G„) = 0(n) whp. For simphcity we state these 
results in the form where the kernel «;„ is independent of n. 

Corollary 3.2. Let n he a graphical kernel on a (generalized) vertex space V , 
and consider the random graphs G^{n,cK) where c> is a constant. Then 
the threshold for the existence of a giant component is c = \\Tf^\\~^ . More 
precisely, if c < ||Tk||~^, then Ci(G^(n,cK)) = Op(n), while if c > ||Tk||~^ 
and K is irreducible, then Ci(G^(n, ck)) = p{cK)n + 0p{n) = Qp{n). 

Corollary 3.3. Let k be a graphical kernel on a (generalized) vertex space 
V. Then the property that ck) has whp a giant component holds for 

every c > if and only if \\Tf^\\ = oo. Otherwise it has a finite threshold 
Co > 0. 

The corollaries above are immediate from Theorem the observation 
that \\Tck\\ = c||Tk||, and the fact that k graphical on V implies cn graphical 
on V (see Remark \HA^ . In the light of the results above, we say that a 
kernel k is subcritical if \\Ti^\\ < 1, critical if ||Tk|| = 1, and supercritical if 
||Tk|| > 1. We use the same expressions for a random graph G{n, k) and a 
branching process jC^. 

The next result shows that the number of edges in the graph at the 
point where the giant component emerges is maximal in the classical Erdos- 
Renyi case, or the slightly more general 'homogeneous case' described in 
Example 14. 6t see SectionElfor the proof. (In this result we do need /i(5) = 1 
as a normalization.) 

Proposition 3.4. Let k„ be a graphical sequence of kernels on a vertex 
space V with limit k, and assume that k is critical, i.e. ||Tk|| = 1. Then 
^e{G^ {n, Kn)) ^ ^ ff K < 1/2, with equality in the uniform case k = 1; 
more precisely, equality holds if and only if k{x, y) d^{y) = 1 for a.e. x. 

We can also determine the asymptotic number of edges in the giant com- 
ponent. As this is not always uniquely defined, for any graph G, let Ci(G) be 
the largest component of G, i.e., the component with most vertices, chosen 
according to any fixed rule if there is a tie. In order to state the next result 
concisely, let 

Cii^) ■= ^ jj^^ K-ix, y) {p{k; x) + p{k; y) - p{k; x)p{k; y)) dp{x) dp{y). (3.3) 

Note that the bracket above is the probability that, given independent 
branching processes X^ix) and 3iK{y), at least one survives. Intuitively, 
given that a certain edge is present in G^{n,Kn), this edge is in the giant 
component if, when exploring the rest of the graph from its end- vertices, 
there is at least one from which we can reach many vertices. 
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Theorem 3.5. Let be a graphical sequence of kernels on a (generalized) 
vertex space V with quasi-irreducible limit k. Then 

-e(Ci(G^(n,K„))) i^Cl'^). (3.4) 

This resuh will be proved in Section [TUl together with some properties of 
C(k). 

Under our assumptions, the giant component is whp unique when it exists; 
the second largest component is much smaller. Indeed, as we shall show 
in Section!^ only Op(re) vertices are in 'large' components other than the 
largest. 

Theorem 3.6. Let be a graphical sequence of kernels on a (generalized) 
vertex space V with quasi-irreducible limit k, and let Gn = {n, Kn) ■ If 
Lv{n) oo and uj{n) = o{n), then 

Cj{Gn) = Oj,{n). (3.5) 

j>2: Cj(Gn)>uj{n) 

In particular, 

C2(G„) = 0p{n). (3.6) 

Remark 3.7. If k is reducible and the (absolutely) continuous ran- 

dom variables, then G^{n,K) decomposes into two (or more) disjoint parts 
that can be regarded as G^^{ni, k,), for suitable Vj, nj and Kj. By consider- 
ing the parts separately, many of our results for the irreducible case can be 
extended to the reducible case; note, however, that each of the parts may 
contain a giant component, so it is possible to have C2 = G)p{n). The re- 
striction to the case where the Xi are continuous, which includes the Poisson 
case of Example 14. 9( is necessary unless we impose a further restriction on 
k: in general there may be a subset A <Z S oi measure zero which always 
contains some Xj, and this can link the subgraphs G^^{ni, Ki). Worse still, 
such an A may contain an Xi with probability bounded away from and 1, 
so ^Ci{G^ {n, k)) need not converge in probability. 

Remark 3.8. If k and k' are two kernels on the same vertex space with 
K < k', then G^(n, k) and G^{n, k') are random graphs on the same vertex 
set, and there is a natural coupling between them in which G^{n, k) is 
always a subgraph of G^(n, k'), i.e., a coupling with G{n,K) Q G{n,K'). 
Similarly, one can couple the corresponding branching processes so that 
every particle present in one is present in the other, i.e., so that Q X^'- 
Thus p{k) < p{k!). If K is irreducible and p{k) > 0, then it follows from 
Theorem 16 . 21 and Lemma l5 . 1 21 that p(k') > p(k) unless e. Similarly, 

the threshold co(k') := ||Tk'||~^ is at most co(k) := ||Tk||^"'^. Here, however, 
somewhat surprisingly, we may have co(k') = co(k) even if k' > n; see 
Subsection 116.31 On the other hand, it is easily seen that if T^j is compact, 
K is irreducible, and k' > k on a set of positive measure, then HT^'H > \\Ti^\\ 
and thus Co(k') < co(k). 
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3.2. Stability. The giant component of Gn = G^{n,Kn) is stable in the 
sense that its size does not change much if we add or delete a few edges; 
this is made precise in the following theorem. Note that the edges added 
or deleted do not have to be random or independent of the existing graph; 
they can be chosen by an adversary after inspecting the whole of Gn- Also, 
we may delete vertices instead of (or as well as) edges. 

Theorem 3.9. Let he a graphical sequence of kernels on a (generalized) 
vertex space V with irreducible limit k, and let Gn = G^{n,Hin)- For every 
e > there is a 6 > (depending on k) such that, whp, 

{p{k) - e)n < CiG'J < + e)n (3.7) 

for every graph G'n that may he obtained from Gn by deleting at most 6n 
vertices and their incident edges, and then adding or deleting at most 6n 
edges. 

In particular, if G'n is a graph on V{Gn) with e(G^ A Gn) = Op{n) then 

Gi{G'J = Ci{Gn) + Op{n) = p{K)n + Op{n). 

Theorem 13.91 is proved in Section 1111 Clearly, in proving the first inequality 
in (|3.7j) , we may assume that G'n C Gn , and in proving the second that Gn C 
G'n- The latter case will be easy to deal with using Theorem l3.6l Proving the 
first inequality amounts to showing that, whp, the giant component of Gn 
cannot be cut into two pieces of size at least 0(n) by deleting o(n) vertices 
and then o(n) edges. For edge deletion, Luczak and McDiarmid |20| gave 
a very simple proof of this result in the Erdos-Renyi case, which adapts 
easily to the finite type case and hence (using our general results) to the full 
generality of Theorem 13.91 This proof is presented in Section 1111 

Another approach to proving Theorem 13.91 involves reducing this state- 
ment to an equivalent statement about the two-core, which is very easy to 
prove in the uniform case. This reduction involves relating the two-core to 
the branching process, using results that we believe are of interest in their 
own right, presented in Section [TTl Unfortunately, while the general case of 
the two-core result can be proved by branching process methods, the proof 
is very complicated, so we shall not give it. 

Remark 3.10. Theorem 13.91 mav be viewed as a statement about the vul- 
nerability of large-scale networks to attack by an adversary who knows the 
detailed structure of the network, and attempts to disconnect the network 
into small pieces by deleting a small fraction of the vertices or edges. The 
vulnerability of 'scale-free' networks to such attacks has been considered by 
many people; see, for example, |^ EHl IS] ; it turns out that such net- 
works are much more resilient to random failures than G{n,c/n), but also 
more vulnerable to attack. In general, the flexibility available to the attacker 
makes rigorous analysis difficult, although a result for the Barabasi- Albert 
model was given in |24j . Theorem 13.91 shows in particular that, for G^(n, k). 
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the network is at most a constant factor more vulnerable than a homoge- 
neous network: a constant fraction of the vertices or edges must be deleted 
to destroy (or significantly shrink) the giant component. 

Remark 3.11. As pointed out by Britton and Martin-Lof [28^, in the case of 
vertex deletion Theorem 18.91 also has the following interpretation: suppose 
that Gn represents the network of contacts that may allow the spread of 
an infectious disease from person to person, and that we wish to eliminate 
the possibility of an epidemic by vaccinating some of the population. Even 
if the entire network of contacts is known, if the source of the infection is 
not known, a significant (constant, as n — > co) proportion of the population 
must be vaccinated: otherwise, there is still a giant component in the graph 
on the unvaccinated people, and if the infection starts at one of its vertices, 
it spreads to 0(n) people. 

3.3. Bounds on the small components. For the classical random graph 
G(n, c/n) it is well-known that in the subcritical (c < 1) case, Ci = O(logn) 
whp, and that in the supercritical (c > 1) case, C2 = O(logn) whp; see 
jl6l I59j . for example. These bounds do not always hold in the general 
framework we are considering here, but if we add some conditions, then we 
can improve the estimates Op(n) in Theorem 13. II and 1)3. 6|) to O(logn) whp. 
As before, we write G„ for G^(n, k„). 

Theorem 3.12. Let («;„) he a graphical sequence of kernels on a (general- 
ized) vertex space V with limit k. 

(i) // K is subcritical, i.e., ||Tk|| < 1, and sup^ y ^ Kn{x,y) < 00, then 
Ci{Gn) = 0{logn) whp. 

(ii) If K is supercritical, i.e., \\T^\\ > 1, k. is irreducible, and either 

,y,n l^n{x, y) >0 or sup^ y j^ Kjj(a;, y) < 00, thenC2iGn) — O (log 71) 
whp. 

Theorem 13.121 is proved in Section 1121 Note that in part (ii) we draw the 
same conclusion from the very different assumptions \nix,y,ni^n{x,y) > 
and sup^ „ k„(x, y) < 00. There is no similar result for the subcritical case 
(part (i)) assuming only that mfx^y^nKn{x,y) > 0: ^1 Theorems 1 and 2] 
show that the random graph gII\c) with 0<c<l/4is subcritical and 
satisfies Ci{G]I\c)) = n®(i) whp. 

3.4. Degree sequence. We next turn to the degrees of the vertices of G„ = 
G^(n, Kn), where «;„ k. As we shall see, the degree of a vertex of a given 
type X is asymptotically Poisson with a mean 

\{x) := / K{x,y)dfi{y) (3.8) 
Js 

that depends on x. This leads to a mixed Poisson distribution for the degree 
of a (uniformly chosen) random vertex of G„ . We write for the number 
of vertices of Gn with degree k. 
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Theorem 3.13. Let he a graphical sequence of kernels on a (general- 
ized) vertex space V with limit k, and let Gn = G^^ {n, Kn)- For any fixed 
k>0, 

J s 

where X{x) is defined by ()3.8() . Equivalently, 

Zk/\ViGn)\^m = k), 

where H has the mixed Poisson distribution J^Po{X(x)) dfi{x) / fi{S) . 

In other words, if D is the degree of a random vertex of G„ = G^{n, Kn), 
and we normalize so that fJ-{S) = 1, then 

£{D I Gn) ^ = Po(A(x)) d^i{x). 

As we shall show in Section^] and Subsections 1 1 6 . 2l and 1 1 6 .11 our model 
includes natural examples of 'scale-free' random graphs, where the degree 
distribution has a power-law tail. We believe that when it comes to mod- 
elling real-world graphs with, for example, observed power laws for vertex 
degrees, our model provides an interesting and flexible alternative to exist- 
ing models based on generating graphs with a given degree sequence (e.g., 
Molloy and Reed [771 [TSj), oi' given expected degrees (e.g., Aiello, Chung 
and Lu P). 

3.5. Distances between vertices. Next, we consider the distances be- 
tween vertices of G„ = G^(n, k„) where, as usual, graphical sequence 
of kernels on V with limit k. Let us write d{v, w) for the graph distance be- 
tween two vertices of Gn, which we take to be infinite if they lie in different 
components. Note that 

\{{vM--div,w)<oc]\=^l^^^^^-^y (3.9) 

where {v,w} denotes an unordered pair of distinct vertices of G„. 

Under certain conditions, we can give upper and lower bounds on d{v, w) 
for almost all pairs with d{v,w) < oo. 

Theorem 3.14. Let k„ be a graphical sequence of kernels on a (generalized) 
vertex space V with limit k, with ||Tk|| > 1. Let Gn = G^ {n, Kn), and let 
e > 6e fixed. 

(i) // K is irreducible, then 

irr ,/ N II Ci(Gn)'^ , 9. p{kS^V? , 9, 

\{{v,w]:d{v,w)<^}\ = ^^^^Oy{r?) = ^-^^ ^ o^ir?). 

(ii) S^Px,y,ra y) < '^y then 

\{{vM-A^M < (1 -e)logn/log||r,||} I =0p(n2). 
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(iii) If n is irreducible and WT^W < oo, then 

\{{v,w} : d{v,w) < (1 +e)logn/log||T«||}| = piKfn'^/2 + Op{n'^). 

(iv) If K is irreducible and ||Tk|| = oo, then there is a function f{n) = 
o(logn) such that 

\{{v,w} : d{v,w) < /(n)}| = p{Kfn'^/2 + 0p{n'^). (3.10) 

Note that part |(i)| is immediate from H3.9() and Theorems 13.11 and 13.61 
Related earUer results are discussed briefly in Section [TH 

In the finite-type non-critical case, we can give an asymptotic formula for 
the 'diameter' of Gn, i.e., for 

diam(G„) := maic{d{v,w) : v,w £ V{G), d{v,w) < oo}, 

the maximum of the diameters of the components of G„. This turns out to 
depend not only on the norm of T^, but also on the norm of the operator 
associated to the 'dual kernel' k. 

Definition 3.15. Let k be a supercritical kernel on a (generalized) ground 
space {S, /x). The dual kernel is the kernel k on the generalized ground space 
{S, fi) defined by k{x^ y) = k;(x, y), with d[i{x) = (1 — p(k;; x)) dfi{x). 

Note that k and k are identical as functions on S x S. However, they 
are defined on different generalized ground spaces. Hence, the operators 
and Tf^ have (in general) different norms. If we wish to consider only 
ground spaces, we may renormalize, defining k' on (5,/t') by k'{x,y) = 
(1 — p{k))k{x, y) and d[ji'{x) = (1 — p{k; x))/{1 — p{k)) dp{x). The choice of 
normalization does not affect the norm of the operator: \\Tf^\ = ||r;^/||. 

The relevance of the dual kernel is that it describes the 'small' components 
of G^(n, k); see Section Wl\ The distribution of these small components is 
essentially the same as the distribution of trees hanging off the two-core of 
the giant component, which affects the diameter of G^(n, k). 

Theorem 3.16. Let k be a kernel on a (generalized) vertex space V = 
{S, p, (x„)), with 5 = {1, 2, . . . , r} finite and p{{i}) > for each i, and let 
G„ = G^{n,K). If 0< \\TJ < \, then 

diam(G„) p 1 
logn logllTKll^^ 
as n —> oo. If ||Tk|| > 1 and k is irreducible, then 
diam(G„) p 2^1 

logn iog||r^||-i iog||r«;||' 

where k is the dual kernel to k. 

Note that we do not require k to be graphical on V: if V is a vertex space, 
then, as S is finite, any kernel k on V is graphical; see Remark 14.51 If V 
is a generalized vertex space, then k need not be graphical. However, by 
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conditioning on the sequences (x„), we can reduce to the vertex space case; 
see Subsection 18.11 

The assumptions of Theorem 13 . 1 61 are much more restrictive than those of 
our other results: we require the type space to be finite. Note, however, that 
even the single type case of this result, concerning the classical random graph 
G(n,c/n), is non-trivial; it answers in the negative a question of Chung and 
Lu [HHl • This special case of Theorem 13.161 was proved independently by 
Fernholz and Ramachandran again as a special case of a result for a 
more general model. The nature of their model makes their proof much 
more difficult than that of Theorem 13. 161 see Subsection 114.21 

3.6. The phase transition. Finally, we turn to the phase transition in 
G^{n,K), where the giant component first emerges. As usual, to study the 
transition, we should vary a single density parameter. Here, it is most nat- 
ural to fix a graphical kernel k on a vertex space V, and to study G^{n, ck) 
for a real parameter c > 0, as in Corollarv 13.21 Instead, we could consider 
random subgraphs of G^{n,K) obtained by keeping each edge, or edge ver- 
tex, independently with probability p, and use p as the parameter; as we 
shall see in Examples 14.101 and l4. Ill all three approaches are equivalent, so 
we shall use the first. 

By Theorem 13.11 the size of the largest component of G^{n,CK) is de- 
scribed by the function p{ck), which is for c < cq := ||Tk||~^ and strictly 
positive for larger c. With V and k fixed, let us denote this function by p{c), 
c > 0. We shall see (from Theorem 16. 4|) that p{c) is continuous on (0, oo). 

Since p{c) = for c < cq but not for larger c, the function p is not analytic 
at Co; in physical terminology, there is a phase transition at cq. 

For the classical Erdos-Renyi random graph G{n,c/n) (obtained with 
K = 1), it is well-known that p is continuous but the first derivative has 
a jump at Co = 1; more precisely, p' jumps from to p+(co) = 2. For 
finite d, we shall say that the phase transition in G^{n,K) has exponent 
k if p{cq + e) = Q{e^) as e \ 0. As we have just noted, in G{n,c/n) 
the phase transition has exponent 1. If p{cQ -|- e) = o{£^) for all k, we 
say that the phase transition has infinite exponent. We are deliberately 
avoiding the physical term 'order', as it is not used in a consistent way in 
this context. In other contexts, discontinuous phase transitions are possible; 
see, for example, Aizenman, Chayes, Chayes and Newman ji2j. 

It was shown in [12] (see also Dorogovtsev, Mendes and Samukhin j^H] and 
Durrett [Sj) that in the case S = (0, 1] and K{x,y) = l/(x V y), the phase 
transition 'is of infinite order', i.e., has infinite exponent (see Subsection ll6.11 
for more details). We shall see in Subsection 116.41 that it is also possible to 
have a phase transition with any finite exponent larger than 1 (including 
non- integer values). 

The next theorem shows that the phase transition has exponent 1 for a 
wide class of kernels k, including all bounded k. We also prove that for this 
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class there is no other phase transition: as /3 = on (0, cq), it is trivially an- 
alytic there, and we shall prove that p is analytic on (cq, oo). As p is defined 
in terms of the branching process, rather than a graph, we do not need a 
vertex space for the statement of the next result; to deduce conclusions for 
graphs of the type we consider, we should let k be an irreducible graphical 
kernel on a vertex space V, satisfying the additional condition (|3.1H) below. 
(Also, there is no need to consider generalized ground spaces, as we may 
trivially normalize so that p{S) = 1 by multiplying k by ijl{S) and dividing 
IJL by the same factor - this leaves the branching process unchanged.) When 
we say that a function / defined on the reals is analytic at a point x, we 
mean that there is a neighbourhood of x in which / is given by the sum 
of a convergent power series; equivalently, / extends to a complex analytic 
function in a complex neighbourhood of x. 

Theorem 3.17. Let k he a kernel on a ground space {S,p). Suppose that 
K is irreducible, and that 

sup / k{x, y)"^ dp,[y) < oo. (3-11) 
X Js 

(i) The function c p(c) := p{ck) is analytic except at cq := ||Tk||~"'^. 

(ii) The linear operator T^ has an eigenfunction ip of eigenvalue \\Tt^\\ < 
oo, and every such eigenfunction is bounded and satisfies 

p(co + e) = 2c^ii^J^e + 0(e2), e > o, (3.12) 
Js r 

so p'j^{cq) = ^Cq"^ jg il) jg'^'^ / > and p has a phase transition 

at Co with exponent 1. 

The proof is given in Section 1151 Note that (|3.11|) implies that k, ^ L?' 
L^. Theorem 13.171 has an easy consequence concerning the extremality of 
the Erdos-Renyi random graphs, also proved in Section 1131 

Corollary 3.18. Let k be an irreducible kernel on a ground space {S,p) 
such that (|3.1H) holds, and let cq := HTkH"^ > 0. Then cop'_^_{co) < 2, with 
equality in the classical Erdos-Renyi case; more precisely, equality holds if 
and only if k{x , y) dp{y) = 1 for a.e. x. 

Let K be an irreducible graphical kernel on a vertex space V; let us as- 
sume (|3.11|1 and, as a normalization, that cq = 1. Letting c increase from 
the threshold cq, Corollarv 13. 181 savs that the giant component of G^(n, ck) 
has maximal growth-rate in the Erdos-Renyi case, and, more generally, in 
the 'homogeneous' case treated in Example 14.61 below. In this example, the 
vertex degrees are more or less the same, so there is no first-order inhomo- 
geneity in the graph; any inhomogeneity in vertex degrees leads to a slower 
growth. 

Remark 3.19. By Theorem 13.51 the number of edges in the giant compo- 
nent of G(n, ck) near the phase transition is asymptotically determined by 
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the behaviour of the function C(ck) as c \ cq. As we shaU show in Propo- 
sition ^3 if II^kII < oo, then Q{ck)/p{ch) ^ 1 as c \ cq := HTkH"^. In 
particular, under the conditions of Theorem 13 .171 there is a phase transition 
of exponent 1 in too. (In addition, the proof of Theorem 18.171 will show 
that c I— > C,{cK,) is also analytic except at cq.) In the case ||Tk|| = oo, when 
Co = 0, it is not always true that C(ck) ~ p{c^^) as c \ cq: this will be shown 
by Example 14.131 An important case when this does hold is described in 
Subsection 116.41 

4. Examples 

In this section we give several simple examples of the random graph model 
we study; these examples are chosen to illustrate the definitions and the 
scope of the model, as well as various pathologies that may occur. In sub- 
sequent sections we shall refer to several of these examples; in particular, 
many of our proofs will be based on the 'finite-type' case. Further examples 
of interest in their own right are discussed at length in Section 1161 as ap- 
plications of our results. We often suppress the dependence on V, writing 
G(n,K) for G^(n,K). 

Example 4.1. The Erdos-Renyi random graph. If k = c is constant, 
then the edge probabilities pij given by 1)2. 3() are all equal to c/n (for n > c). 
Thus any choice of vertex space gives the classical Erdos-Renyi random 
graph G{n,c/n). The simplest choice is to let S consist of a single point. 
Then the operator is simply multiplication by c, so ||Tk|| = c and Corol- 
lary |SI2] yields the classical result that there is a phase transition at c = 1. 
Furthermore, the function p{c;x) reduces to the single value /9(c), and the 
survival probability p(c) of the branching process Xc is given by the formula 

p[c) = I - e-''P^''\ with p(c) > if c> 1; (4.1) 

this classical branching process result is the simplest case of Theorem 16.21 
below. Returning to the graph, in this case Theorem 13.11 reduces to the 
classical result of Erdos and Renyi |17] . 

Example 4.2. The homogeneous bipartite random graph. Set S = 

{1,2}, /i{l} = Ai{2} = 1, and let V = (5,/x, (x„)) be a generalized vertex 
space in which x„ consists of n vertices of type 1 and n vertices of type 2. 
Let K be defined by k(1,1) = k(2,2) = and k;(1,2) = k;(2, 1) = c. Then 
G^(n, k) is the random bipartite graph G(n, n; c/n) with n vertices in each 
class, where each possible edge between classes is present with probability 
c/n, independently of the other edges. While it is natural to use a generalized 
vertex space to describe this example, it is not necessary: the same graph 
can be written as G^(n, k) in another way: take /i{l} = /^{2} = 1/2, and let 
V be a vertex space where x„ is defined only for n even, and then consists 
of m = n/2 vertices of each type. Let k(1,1) = k(2,2) = as before, and 
k(1, 2) = k(2, 1) = 2c, so the edge probabilities are 2c/n = c/m. 



THE PHASE TRANSITION IN INHOMOGENEOUS RANDOM GRAPHS 



25 



Example 4.3. The finite-type case. Let S = {si, . . . , Sr} be finite. Then 
K is an r X r matrix. In this case, G{n, k) has vertices of r different types (or 
colours), say rii vertices of type i, with two vertices of types i and j joined 
by an edge with probability n~^K[i,j) (for n > max/t). The condition (|2.2|) 
means that rii/n ^ in for each i (in probability if the rij are random) , where 
:= > 0. 

This case has been studied by Soderberg |HH1 EHl EOl 1^ j who noted our 
Theorem 13. II in this case (with for all n). 

Most of our proofs will be based on a disguised form of this case, described 
by the following definition. 

Definition 4.4. A kernel k on a (generalized) ground space (5, ^i) is regular 
finitary if S has a finite partition into sets Si, . . . , Sr such that k is constant 
on each Si x Sj, where each Si is a /^-continuity set, i.e., is measurable and 
has iJ-{dSi) = 0. 

Clearly, if k is regular finitary on (5, /x) then the random graph G^(n, k) 
has the same distribution as a finite-type graph G^'(n, k'), V' = (5', /x', (yn))'- 
take S' = {!,..., r}, let yt = i whenever Xk G Si, and define /^'{i} and 
K'{i,j) in the obvious way. Let rii = #{/ : xi £ Si} = nVniSi), where Vn 
is as in (|2.H) . The numbers rii may be random, but since each Si is a fi- 
continuity set, 1)2. 2|) yields rii/n = VniSi) f^iSi), so V' is a (generalized) 
vertex space. 

Remark 4.5. Let us note for later that a finite-type or regular finitary 
kernel k on a vertex space V is automatically graphical on V; conditions |(i) 



and (ii) of Definition 12.71 are trivial in this case, while condition (iii) holds in 
the much more general case of k bounded; see Lemma l8. II This observation 
does not extend to generalized vertex spaces: there may be a very large 
number of vertices with some small probability, so the expectation in 1)2. 9(1 
need not converge, or even exist; see Remark 18.21 

Example 4.6. The homogeneous case. Generalizing the Erdos-Renyi 
and homogeneous bipartite cases above, let (5,/i) be an arbitrary (gener- 
alized) ground space, and let k be such that k{x, y) dfj,{y) is essentially 
independent of x G S, i.e., that 

k{x , y) dfj.{y) = c for a.e. x, (4-2) 

s 

for some constant c. (This says roughly that, asymptotically, all vertices 
have the same average degree.) Then T^l = c a.e., so the constant function 
1 is a positive eigenfunction with eigenvalue c, and thus ||Tk|| = c, and by 
Theorem 13. II there is a giant component (and p{k) > 0) if and only if c > 1. 

Normalizing (if necessary) so that /u(5) = 1, in the branching process, 
apart from particles with types in a measure zero set, which arise in 
with probability 0, the number of children of each particle has a Po(c) dis- 
tribution. Hence, ignoring the types of the particles, the distributions of 
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the process and the single-type process X-c are the same. In particular, 
p{k) = p{c), so p{k) = p{c) is given by (|4.1|) in this case too. If n is irre- 
ducible, the global behaviour of G(n, k) is thus exactly the same as that of 
G{n,c/n), at least in terms of the size of the giant component. The local 
behaviour can be quite different, though. For example, G{n, k) may have 
many more triangles or other small cycles than G{n, c/n); see Example ll7.4l 
On the other hand, by Theorem l3.131 the vertex degrees have an asymptotic 
Po(c) distribution just as in G{n,c/n). 

A natural example of such a homogeneous k is given by taking S as 
(0,1] (now better regarded as the circle T), p as Lebesgue measure, and 
n{x,y) = h{x — y) for an even function /i > of period 1. For example, h 
can be constant on a small interval {—5,5) and vanish outside it; this gives 
a modification of G{n,c/n) where only "short" edges are allowed. 

More generally, S can be any compact homogeneous space, for example 
a sphere, with Haar measure p and an invariant metric d, and k{x, y) a 
function of the distance d{x,y). 

Example 4.7. Take S = (0, 1] with p the Lebesgue measure, and let Xi = 
i/n. Set K{x,y) = l[x + y < i] and consider the kernel ck, so that 



Pij 



c/n, i + j < n; 
0, i + j > n. 



Thus G(n, ck) can be obtained from the random graph G{n, c/n) by deleting 
all edges ij with i + j > n. 

The operator is compact, and it easy to see that it has eigenvalues 
(— l)'^^^"'^ and eigenfunctions cos{uJkx), with = {k + \/2)tt, = 0, 1, . . . . 
Hence \\T^\\ = 1/-k and the critical value is cq = tx/2. Theorem K-i . 1 71 shows 
that at the critical value we have copY('^o) = 3/2. 

Example 4.8. I.i.d. vertices. For any ground space {S,p), we can obtain 
a vertex space by taking xi,...,Xn to be i.i.d. random points in S with 
distribution p. (This has been proposed by Soderberg 88 .) In this case 

Ee{Gin,.))=^^g^<^^dpix)dpiy) 

< ^ jj^^ y) dp{x) dp{y). 
Hence, by Lemma l8. II below. (|2.9j) always holds, and to verify that a kernel 



K is graphical, we only have to check conditions (ij and|(ii)|in Definition 12. 71 



Similarly, for a sequence of kernels, (|2.11|) holds provided JJ Kn ^ JJ k. 

Example 4.9. Poisson process graph. For any generalized ground space 
(5, p) and any A > 0, let xx = (xi, . . . , x^^) be the points of a Poisson pro- 
cess on S with intensity measure Xp. In other words, v\ has a Poisson 
distribution Fo{Xp{S)), and, given vx, the points Xi are i.i.d. as in Exam- 
ple l4.8l Then (5, p, (x^)) is a generalized vertex space. Here, it is natural to 
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write A rather than n for an element of the index set / = (0, oo). Note that 
(|2.4j) holds because Xi'\{A) ~ Po(Xfi(A)). This is the canonical example of a 
generalized vertex space, and one of the main reasons for allowing a random 
number of vertices. 

Let K be a kernel on so, given x;^, the edge probabilities in the 

graph G^(A, k) are given by 

Pij = mm{K{xi,Xj)/X, l}, 

for 1 < i < j < vx- As in Example 14.81 1)2. 9|) always holds. To see this, 
let V = {S,fi', (yn)n>i), where fi' = fi/fj,{S) is the normalized version of fi, 
and Yn consists of n i.i.d. points of S chosen with distribution /i'. Given that 
vx = n, the distribution of G^(A,k) is exactly that of {n, {n/ X)k). In 
particular, as 1)2. 9|) holds for the latter graph, 

E(e(G^(A, K))\vx = n)^^ [[ ^ df,'{x) dfi'{y) 



52 A 

K{x,y)dn{x)dfi{y). 



As A —> 00 we have lK{vl) ~ (A/u(5))2, and follows. Hence, as m 

Example 14. 8( a kernel k on (5, fi) is graphical on V if and only if conditions 



(i) and (ii) of Definition 12.71 hold. 

In this Poisson process example, it is easy to see that allowing a random 
number of vertices makes the model only superficially more general. Indeed, 
renormalizing so that fJ-{S) = 1, since vx ~ Po(A), we can regard vx as 
a random function of A, which is increasing (a Poisson process), and then 
vx/X 1 as A ^ 00. It follows that if we condition on the process vx{X), 
then Theorem 13.11 applies a.s. to the corresponding graphs {vx, {vx/X)k) 
on the (ungeneralized) vertex space V'. Thus, conditioning on vx{X), 

A^iCi(G^(A,/€)) ^p(k) as a ^00 (4.3) 

holds a.s. It follows that ()4.3() holds unconditionally too. Other properties 
can be treated similarly. We shall see later, in Subsection 18.11 that all our 
results for generalized vertex spaces can be reduced to the vertex space case. 

Example 4.10. Edge percolation. Let k be an irreducible graphical ker- 
nel on a (generalized) vertex space V with \\Tfi\\ > 1, and let < p < 1. 
Independently of everything else, keep each edge in G{n, k) with probabil- 
ity p and delete it with probability 1 — p. Denote the resulting graph by 

This random graph G^\n, k) is nothing but G{n, kn), where 

kn{x,y) ■.= p{K{x,y) An). 

Clearly, Xn ^ x and yn ^ U imply kn{xn,yn) pn{x, y), provided (x, y) is a 
point of continuity of k. Furthermore, ^ Ee(G^''^(n, k)) = ^ Ee(G(n, k)) 
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Po JJ 1^- Hence, ( graphical sequence with limit pn, so Theorem 

applies with k replaced by pn^ and 

n-^Ci{G^'P\n,K)) ^ p{pn). 

In particular, G^'^\n,K) has whp a component of order G(n) if and only 
if \\Tpi^\\ > 1, i.e., if p > ||Tk||^"'^. Thus, as expected, we obtain the same 
threshold for edge percolation in k) (meaning that there remains a 
giant component) as for the existence of a giant component in G{n,pK); see 
Corollary O 

Of course, the same conclusions follow if we start with the more general 
setting of Definition 12.91 

Example 4.11. Vertex percolation. Again, let k be an irreducible graph- 
ical kernel on a vertex space V = {S, fi, (x„)) with ||Tk|| > 1, and let < p < 
1. Independently of everything else, keep each vertex in k) with prob- 

ability p and delete it with probability 1 —p- Denote the resulting graph by 
G^\n, k). This graph is again an instance of our model with a generalized 
vertex space. Indeed, writing y„ for the subsequence of x„ corresponding to 
the vertices that were not deleted, V' = {S,pfj., (yn)) is a generalized vertex 
space, and G^\n,K) has exactly the distribution of G^ {n,K). Since the 
kernel k is graphical on V, and E(e(G[Pl(n, k))) = E(e(G^(n, k))) , the 
kernel k is also graphical on V', so our results apply to G^' {n, n) and hence 
to 

Here, one must be a little careful with the normalization: the norm of 
defined with respect to is p times \\Ti^\\, the norm defined with 

respect to (5, //). In particular, Theorem 13.11 tells us that G^\n^ k) has whp 
a component of order 0(n) if and only if p||Tk|| > 1, i.e. if p > ||Tk||~^. We 
thus obtain the same threshold for vertex percolation in k) as for edge 
percolation in Example 14.101 

Once again, we could have started with the setting of Definition \2.S)\ we 
could also have started with a generalized vertex space. 

Note that we can obtain the Poisson graph G\{k) in Example 14.91 as a 
limit of the vertex percolation model G'^I (n, k) in Example 14.111 if we take 
p = \/n and let n — > oo. 

Our next example shows that even in the supercritical, irreducible case, 
the second largest component may be rather large - certainly much larger 
than O(logn) as in the Erdos-Renyi case. 

Example 4.12. Large second component. Let S = {1,2,3,...} with 
= , and let xi, . . . , x„ be i.i.d. random points in S with distribution 
//. Let (sfc)^ be a sequence of positive numbers tending to zero, to be 
chosen below. Set k) = 2^^^ for A; > 1, k(1, k) = K{k, 1) = Sk for 
k > 2, and K,{i,j) = otherwise. Note that k G L^{S x S,fi x p); as 
noted in Example 14. 8| from our choice of Xi it follows that k is graphical on 
(x„)„>i). 
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For each k > 1, the graph G{n,K) contains ~ Bi(n, 2"^^) vertices of 
type k, forming a random subgraph Hk which has the distribution of the 
Erdos-Renyi graph G{nk,2^~^^ /n). Each potential edge between Hi and 

1/2 

H^. is present with probabihty e^/n. Note that = n/c + Op{nj^ ), where 
flfc = Erifc = and thus each iJ^ is (whp) supercritical. In particular, 

whp Ci(G(n, k)) > Ci{Hi) > cn for some c > 0, so G{n, k) is supercritical. 

Let kn — > oo with log2 n — kn — > oo, so that n^^ — > oo. Let us choose the 
Ek so that < Then the expected number of edges between Hi and 
is E{nini:„)ek„/n < nek„ 0, so whp Hk„ is isolated in G{n,K). As 

1/2 

nk„ = nk„ + Opijij!^ ), we may couple the k) for different n so that 

nfc„=nfc„+0(n^f) (4.4) 

holds a.s. (Here the implicit constant is random.) We may then condition 
on nk„, assuming that n^^ is deterministic, and that (|4.4j) holds. 

Clearly, H^^ is a uniform Erdos-Renyi random graph G{nk„,2^'^~^^ /n). 
As 2''"~^^/n ~ 2/nfc„, this graph is supercritical (for large n), and has a 
largest component of order (c + Op(l))nfc^ for some constant c. Thus, 

Ci{Hk„) = (c + Op(l))nfe„ = (c + Op(l))nfc„ = (c + Op(l))n/2'=" . 

Given any function uj{n) with a;(n) = o(n), we can choose A:„ so that 
2^'-Lo{n)/n 0; it follows that whp C2(G(n, k)) > Ci(i?fcJ > (c/2)n/2^" > 
u;(n). Thus, the Op(n) bound in Theorem 13.61 is best possible. 

The final example in this section shows that when ||Tk|| = oo, the ratio 
of the number of edges to the number of vertices in the giant component of 
G{n, ck) need not tend to 1 as c — > 0. In fact, it may even tend to oo. 

Example 4.13. Dense giant component. Let 5, and x„ be as in 

Example EH and let k(1,/c) = K{k,l) = 1 for > 1, K{k,k) = A^/k"^ for 
k > 2, and = otherwise. Again k G L^, so k is graphical. Let c > 

be small but fixed and consider G^ {n^cn). Let /cq be the smallest integer 
such that 2^° /kQ > 1/c; taking c small enough, we may assume that ko > 10. 

Using the notation of Example 14.121 if > 2, then H^ forms a random 
subgraph of the type G(nk,cA^ / {k'^n)). Since Ukd^ /{k'^n) ?s c2*^/fc^, this 
subgraph is a supercritical Erdos-Renyi graph if > A;o, and if A; > k^ + \, 
classical results show that whp H^ contains a component of order Q{nk) = 
Q{2~^n) with Q{n^cAh / {k'^n)) = Q{nc/k'^) edges; throughout this example 
the implicit constants in ©(•) and O(-) notation do not depend on c. Each 
of these components is whp of order n, so they are subsets of the giant com- 
ponent of G^{n, k). Summing over k = ko + l, . . . , 2/co, the giant component 
thus has at least @{nc/ko) edges, so C(ck) = Q{c/ko); see Theorem 13.51 

To bound the number of vertices in the giant component, condition on 
xi, . . . ,Xn and say that a vertex of type k is light ii k < ko — 3, and heavy 
otherwise. The total number of heavy vertices is 0{n2~^°) whp. Further- 
more, it is easy to check that if c is small enough, then the expected number 
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of edges to light vertices from each heavy vertex is at most 1/2, as is the 
expected degree of each hght vertex. Each Ught vertex in the giant com- 
ponent has to be connected to some heavy vertex by a path whose other 
vertices ah are hght. As the expected number of such paths starting at a 
given heavy vertex is at most J2i>ii^/'^y — 1' expected number of hght 
vertices in the giant component is 0(n2^'^") too. Hence, the number of ver- 
tices in the giant component is Op{n2~'">), so p{ck) = 0(2"'^") = 0{c/k'^). 
Consequently, C,{ck)/ p{ck) = r2(A;o) = ri(log(l/c)), as c ^ 0. In particular, 
({ck)/ p{ck) — > oo as c \ Co = 0; see Remark 13.191 

5. Branching process lemmas 

In this section and the next we study the Poisson branching processes 
Xk{x) and defined in Subsection 12.11 and their survival probabilities. 
These turn out to be given by the solutions to a certain non-linear functional 
equation (|5.3j) . Let us briefly recall some definitions. 

Let be a (generalized) ground space. The branching process Xk{x) 

is a multi-type Galton- Watson branching processes with type space S: a 
particle of type y £ S is replaced in the next generation by its 'children', 
a set of particles whose types are distributed as a Poisson process on S 
with intensity K{y,z)dp{z). The zeroth generation of consists of a 

single particle of type x. Note that the distribution of Xk(x) is unaffected 
if we multiply k by a constant and divide p by the same constant; thus, we 
may assume without loss of generality that p{S) = 1. We shall make this 
assumption throughout this section. In this normalized case, the branching 
process is just the process ^^(x) started with a single particle whose 
(random) type is distributed according to the probability measure p. 

Here, we have no need for the metric or topological structure of S\ in 
this section and the next, S can be any measurable space equipped with 
a probability measure p. We assume, as before, that the kernel k is a 
measurable symmetric non- negative function on S^. We shall also assume 
that K G L^(5 X X p), i.e., that JJ k <oo. 

Let us recall our notation for the survival probabilities of particles in 
Xk(2;). We write pk{K;x) for the probability that the total population con- 
sists of exactly k particles, and p>fc(«;;x) for the probability that the total 
population contains at least k particles. Furthermore, p{k; x) is the proba- 
bility that the branching process survives for eternity. 

We write Pki^), P>k{i^) and p{k,) for the corresponding probabilities for 
Xk, so that, e.g., pk{K) = fg pk{K;x) dp{x). 

We start with a trivial observation that will enable us to eliminate certain 
pathologies. 

Lemma 5.1. If k = k' a.e., then p{k;x) = p{k';x) and /9>fc(K;x) = 
p>k(,K';x) hold for a.e. x; hence p{k) = p{k!) and p>k{i^) = P>kW)- 

Proof. There is a measure zero set N G S such that x ^ N, then k{x, y) = 
K'{x,y) for a.e. y. It follows that if we start the processes X^ and X^' at 
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the same x ^ N, the processes will be identical in distribution. Hence 
p{k;x) = p{k';x) and p>k{K]x) = p>k{K';x) for all x ^ N, and the result 
follows from ^J^. □ 

For the sake of convenience, in this section we impose one more assump- 
tion on K, namely that 

K{x,y)dn{y) < oo (5.1) 

for every x € 5. This assumption loses no generality, as 1)5. 1(1 holds for 
a.e. X, since JJ n < oo. Writing for the measure zero set of x such that 
H5.1() does not hold, define R by setting R{x, y) = if x £ N oi y £ N, and 
R{x,y) = K{x,y) otherwise. Then k = R a.e., so by Lemma 15.11 we have 
p{R) = p{k) and so on. 

All the assumptions above apply to all the kernels considered below, de- 
noted Ki, k' , etc. In this section, unless explicitly stated, we do not assume 
that K is irreducible. 

Remark 5.2. Condition (|5.1|) means that a particle of type x has a finite 
number of children in the branching process. As we are assuming (|5.1j) for all 
X, a particle survives for eternity (has descendants in all future generations) 
if and only if it has infinitely many descendants. In other words, p(k; x) = 

Poo(^j 

Remark 5.3. Our process is a very special branching process since we 
assume that the children of a particle are distributed according to a Poisson 
process; in particular, the number of children has a Poisson distribution. 
Other branching processes, and other functional equations, appear when 
studying random graphs with dependencies between edges, as in [22 l l25l l8l| . 
but will not be considered here. 

Note also that even with the Poisson assumption, our processes are spe- 
cial. For the branching process, there is no reason to assume k, to be sym- 
metric; moreover, p may be any cr-finite measure, and the hypothesis k £ 
could be weakened to 1)5. 1|) for a.e. x (or perhaps removed completely). We 
shall, however, consider only the special case just defined; this will be useful 
in the proofs. We have not yet investigated to what extent the results gen- 
eralize and remark only that in non-symmetric situations, the norm \\Ti^\\ 
should be replaced by the spectral radius. 

There is an abundant literature on branching processes with different 
types; see, for example, the book by Mode 76_. However, we have not 
found the results we need in the generality required here, so for the sake 
of completeness we give full proofs, although the results are only minor 
extensions of known results; see, for example, [TBI Chapter 6]. 

We start with the connection between our branching process and the 
operator defined in 1)2. 17|) . 
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Lemma 5.4. Consider the random offspring of a single particle of type x; 
let N be the number of children, and denote their types by If 9 is a 

measurable function on S with < 5 < 1, then 

N 

IEn(l - am) = e~(^«3)(^-) = 1 - {^.g){x). (5.2) 

i=l 

Proof. This is a standard formula for Poisson processes; see, for example, 
Kallenberg |6H Lemma 12.2(i)], taking / = — ln(l — g). For completeness, 
we include the simple proof. Indeed, let v = Vxhe the measure defined by 
dv{y) = k{x, y) dn{y). Then N ~ Po(z^(5)) and, given A'', the types of the 
children are i.i.d. with the renormalized distribution z^' = v/v{S). Hence, 
given A^, the conditional expectation of ^^^^^(l — g{Ci)) is just 

N 

nE(i - 5(^0) = (iE(i -5(6)))'^ 

i=l 

N 



(1 - j^g{y)dv'{y)^ = (l - {T,g){x)/u{S)) 



N 



Using P(A^ = n) = e '^^^^v(S)'^ /n\ and taking the expectation, the result 
follows. □ 

Our next aim is to study the fixed points of i.e., the solutions of the 
equation 

/ = 1 - e-^-f, (5.3) 

where / is a non- negative function on S. 

Remark 5.5. li f = g a.e., then = ^kQ- In particular, if / = a-e., 
then $k/ = ^ni^nf)'-, thus, if / satisfies (|5.3|) a.e., then there is a solution / 
to ()5.3|) (viz. $k/) such that f = f a.e. This shows that it makes no essen- 
tial difference if we require 1)5. 3|1 to hold only a.e. (which might be natural 
from an perspective). We shall, however, find it convenient to interpret 
(|5.3() and similar relations as holding everywhere unless we explicitly state 
otherwise. Similarly, if e., then for any solution / to / = there 

is a unique / with the properties that / = <I>k/ and f = f a.e. 

Note that is monotone: if < / < (7 a.e. then T^/ < Ti^g and thus 

^nf < ^ng- 

In the lemma below, 1 denotes the function with constant value 1. 

Lemma 5.6. (i) For m > the probability that a particle of type x has 
descendants in at least m further generations is (<I'™l)(x). 
(u) As 00, (<I>™l)(x) \ p{k;x). 

(iii) The function = Pn{x) = p{k; x) is a solution of (|5.3|) . i.e., satisfies 

^kPk = Pk- 

(iv) The function is the maximum solution of (|5.3|1 ; if f is any other 
solution, then Pk{x) > f{x) for every x. 
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Proof, (i) Let gm{x) be this probability. Then, with g = g^, the left-hand 
side of (|5.2|) is the probability that none of the children of x has descendants 
in at least m generations, i.e., the probability 1 — gm+iix) that x does not 
have descendants in m + 1 generations. Thus gm+i = ^Kg-m, and the result 
follows by induction, since go{x) = 1. 

(ii) An immediate consequence of (i). 

(iii) This follows by the same argument as (i) (and is also a consequence 
of (ii) and dominated convergence). 

(iv) Suppose that / is a solution of (|5.3j) . Then / = < Ij and thus 
/ = '&™/ < *1*^1 for every m. Hence, f < Pk follows from (ii). □ 

Remark 5.7. If we do not impose ()5.1|) for all x, then (iii), i.e., {^kPk){x) = 
Pk{x), could fail for x in the measure zero set for which 1)5. 1(1 does not hold. 
This is because a particle of type x has infinitely many children, which 
may have finite but unbounded lines of descendants; for an example, take 
S = (0,1] and k = 1 except that k(x, 1) = k{1,x) = 1/x. 

We continue to study the functional equation 1)5. 3() . 

Lemma 5.8. Suppose that / > with f = Then 

(i) < / < 1; 

(ii) Tk/ > /, with strict inequality when f{x) > 0; 

(iii) Tk/ <//(! — /), with strict inequality when f{x) > 0; 

(iv) if K is irreducible, then either / = everywhere or / > a.e. 

Proof (i) We have f{x) = l-e'^^-f^^^"^ < 1. Hence, {T^f){x) < (r«l)(x) = 
K{x,y) dn{y) < oo, where the second inequality is just our assumption 
(EU). Therefore, f{x) = 1 - e-(^«^)(^) < 1. 

(ii) This is immediate from / = 1 — e""^*"-^ ^Tf^f, with equality only when 
TJ = 0. 

(iii) We have e"^"*-^ = 1 — /, and thus, as / < 1, 

TJ < e'^-f - 1 = - 1 ^ 



1-/ 1-/ 
with equality only when T^/ = 0. 

(iv) Let A:= {x £S : f{x) = 0}. For x £ A, {^^f){x) = f{x) = 0, and 
thus {Ti^f){x) = 0. Hence K{x,y) = for a.e. y ^ A. Consequently, k = 
a.e. on ^ X (5 \ A), which by ((TT^ implies p{A) = or p{A) = 1. In the 
latter case, / = a.e., and thus / = ^^f = 0. □ 

In the next two lemmas we consider irreducible k. 

Lemma 5.9. Suppose that n is irreducible. Suppose further that f = $^7 
and g = ^^g with < f < g. Then either / = or f = g. 

Proof. By Lemma I5.^f iv)| we may assume that / > a.e. 

Let h = {g- f)/2>0; thus f + h = {f + g)/2. The function t ^ I - e'^ 
is strictly concave; in particular, 1 — e^*^*+"^/^ > ^ ((1 — e~*) + (1 — e"")). 



34 BELA BOLLOBAS, SVANTE JANSON, AND OLIVER RIORDAN 

Hence, 

$ f f + 9 \ ^ _ ^_T«((/+g)/2) ^ ^ _ g-(T,/+T«g)/2 

2 ; (54) 

> 1 ((1 - e-^^f) + (1 - e-^''^)) = lif + g) = f + h, 

with strict inequality at every point where f < g and thus < and 
Tk/ < Ti^g. On the other hand, 

^ - ^'^(^) = = ^'^^^^'^^"^ = (1 - /)«"'^"' (5 5) 

> (i-/)(i-r,/i). 

Combining (|5.4() and ()5.5|) . we find that 

(1 - /)(1 - T^/i) < 1 - (/ + /i) = 1 - / - /i 

and thus 

(1 - f)T^h > h (5.6) 
with strict inequahty when g > f . 

Suppose now that g > / on a set of positive measure. Then, inequahty 



(|5.6j) . the fact that / > a.e., and Lemma I5.^riii)| imply that 

Note that the integrals above are finite because k, ^ L} and f,h< 1. How- 
ever, as K is symmetric, is a symmetric operator, and so f Tf^hdfi = 
hT^f dfi, contradicting (|5.7j) . This shows that g = f a.e. and thus 
f = ^^f = ^^g = g. □ 

Lemma 5.10. Suppose that k is irreducible. Then / = and f = Pk ol^^ 
the only solutions to 1)5. i/iese solutions may coincide. 

Proof. By parts (iii) and (iv) of Lemma 15.61 the function />k is a solution of 
1)5. 3(1 . and </</?«; for every solution / of ()5.3|) . The result follows by 
Lemma 15.91 □ 

It remains to decide whether = or not. Recall that \\T^\\ is defined 
in 1)2. 15(1 . We shall show that = if and only if \\T^\\ < 1. 

Lemma 5.11. // ||Tk|| < I, then = 0. 

Proof. Suppose that / is a solution of 1)5. 3|) . and that we do not have / = 
a.e. Lemma I5.^hi)| implies that T^/ > /, with T^/ > / on a set of 
positive measure, and hence that ||Tk/||2 > ||/||2) contradicting \\Tf^\\ < 1. 
Consequently, if / is a solution of H5.3|l . then / = a.e., and thus / = ^^f = 
0, so the only solution is / = 0. In particular, pn = since p^ is a solution 
by Lemma 15.61 □ 

It remains to show that if IIT^H > 1, then is not identically zero. We 
proceed in several steps. 
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Lemma 5.12. // / > and <I>k/ > /, then g as m ^ oo, for some 

g> f>0 with ^^g = g. 

Proof. By induction, / < <^^f < <^lf < .... Since < < 1, the 

limit g{x) := limm^oo(*J'™/)(^) exists for every x, and g > 0. Monotone 
convergence yields 

iT^g){x)= lim / K(x,y)($-/)(y)d^(y)= lim (T, (x) 

m— >oo J ^ m— >oo 

and thus 

i<^.g){x) = lim ($,($™/))(x) =5(x). n 

Lemma 5.13. // there is a bounded function / > 0, not a.e. 0, such that 
Tuf ^ (1 + ^)f for some 6 > 0, then p,^ > on a set of positive measure. 

Proof Let M = sup/ < oo. Fix e > with (1 - Me)(l + 6) > 1. Since 
— log(l — x) < x/(l — x) we have 

- ln(l - ef) < YT^^f < (1 + ^)^f < ^TJ, 

and thus 

'^.{ef) = 1 - e-'^-f > 1 _ (1 _ e/) = ef. 
By Lemma 15.121 there exists a solution g to <^k<7 = 9 with g > ef, and thus 
g not a.e. 0. By part (iv) of Lemma 15.61 we have Pk ^ g- D 

Remark 5.14. The proof of Lemma 15.131 shows that Pk > TT^sup/' 
particular, this immediately implies Theorem 10 of |19j . 

We should like to find an eigenfunction of with eigenvalue greater 
than 1, so that we can apply Lemma 15.131 If the Hilbert-Schmidt norm of 
Tk (see p.l6|l ) is finite, then a standard result gives us such an eigenfunction. 

Lemma 5.15. // HT^Hi/s < oo, then is compact and has an eigenfunc- 
tion £ L'^{S), V' ^ 0, with eigenvalue \\TfJ\. 

If, in addition, k is irreducible, then tp > a.e., and every eigenfunction 
with eigenvalue \\Tk\\ is a multiple ofip. 

Proof. Suppose that HT^Hhs' < oo. It is well-known (see e.g. ^1 XIV. 6, 
p. 202]) that is then compact, and so has an eigenfunction g £ with 
eigenvalue of modulus A := ||Tk||. Then 

T,,\g\ > \T^g\ = X\g\ a.e., 

and since ||Tfj|| = A we must have Tf^\g\ = X\g\ a.e. Hence ■0 •= \g\ is an 
eigenfunction with eigenvalue A = ||Tk||. 

Now suppose that k is irreducible, with ||Tk||hs' < c>o, and let h be any 
(real) function in with T^h = Xh a.e. By the argument above, Tf^\h\ = 
X\h\ a.e. holds as well. Let A := {\ h\ = }. Then T^\h\ = X\h\ = a.e. on A, 
so K = a.e. on A X [S \ A) and (|2.12|) yields p{A) = or 1. Hence either 
h = a.e. or /i / a.e. In particular, taking h = g we see that ip > a.e. 
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Returning to a general h satisfying T^/i = Xh a.e., as T^d/il + h) = 
\{\h\ + h) a.e. by linearity, we can apply the argument above to \h\ + h, 
deducing that either h > a.e. or /i < a.e. Finally, applying this to 
h — aip, with a chosen such that J{h — aip) = 0, we see that h — atp = 
a.e. □ 

The second part of Lemma 15.151 will be needed only in Section [T31 
After this preparation, it is easy to show that if ||Tk|| > 1 then > on 
a set of positive measure. 

Lemma 5.16. If 1 < \\Tf^\\ < oo, then > on a set of positive measure. 
Thus (|5.3|) has at least one non-zero solution. 

Proof. Since \\Tf^\\ > 1, there is function f & with ||/||2 = 1 and ||rK/||2 > 
1. As TkI/I > ITk/I, we may assume that / > 0. Let T/v be the integral 
operator on S with the truncated kernel K]\f{x,y) := K{x,y) A N, N > 1. 
By monotone convergence, T/v/ /" T^/ as ^ oo, and thus ||TAr/||2 
||2^k/||2 > 1- We can thus choose an A^ such that ||r/v/||2 > 1 = H/lb, and 

thus IITtvII > 1. Set 6 = \\Tn\\ - 1 > 0. 

Since the kernel kn is bounded and /u is a finite measure, by Lemma 15.151 
Tat has an eigenfunction E L'^{S) with V' ^ and 

TniI> = \\TnU = {1 + 5)^. (5.8) 

Since the kernel is bounded, it follows that T^^ip is a bounded function. 
Indeed, {TniI)){x) < Nj^ipdfi = A^HVlli < A^ll^lb < oo. From (|EHI) it 
follows that ip is bounded. 

Since k > k^t > 0, we have, using (|5.8j) again, 

T,i^ > Tn^^ = (1 + 
and the result follows by Lemma l5.13l □ 

The final lemma of this section will enable us to reduce the reducible case 
to the irreducible one. 

Lemma 5.17. Let k be a symmetric measurable function on S x S. Then 
there exists a partition S = UiLo'^» with < N < oo such that each Si is 
measurable, n{Si) > for i > 1, the restriction of k to Si x Si is irreducible 
for each i > 1, and k = a.e. on {S x S)\ Ui^i('5i x Si). 

Note that k = a.e. on Sq x Sq. 

Proof. Let G be the family of all measurable subsets A <^ S such that k = 
a.e. on A X (5 \ A). It is easily verified that ^ is a cr-field; thus {S,G,p,) 
is a finite measure space. Hence there exists a partition S = IJi=o'^« with 
< A^ < oo and each Si € Q such that for each i > 1 the set Si is an atom 
in {S,Q,fj,) with positive measure, while Sq is non-atomic, i.e., contains no 
atoms with non-zero measure. (We allow Sq = 0.) Here '5^ is an atom' 
means that if vl C 5j with A G Q, then fJ,{A) = or fi{A) = fi{Si); this is 
equivalent to (|2.12|1 . so k is irreducible on Si x Si for each i > 1. 
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Finally, since Sq is non-atomic, for every positive integer M there exists 
a partition Sq = Ujli with Tj € Q and iJ-{Tj) = ii{Sq)/M. Then k = 
a.e. on Tj x Tj when i ^ j, and thus 



M 



(/zx^){(x,y) G cSoX^o : K{x,y) / O} < (^u x /x) ( J(r,xT,)) = m( 

i=i 

Letting M — > oo, we see that k = a.e. on 5o '^q. □ 



Remark 5.18. One application of Lemma f5.17l is a generalization of Lemma 
IS.lUl to arbitrary k. With Si as in Lemma 15.171 let J be the set of indices 
i such that the restriction of the operator to L?'{Si) has norm strictly 
greater than 1. Then there are 2l'^l solutions of 1)5. 3() . where < | J| < oo: 
for every subset J' C J, there is exactly one solution that equals a.e. on 
(J^gj, 5j and vanishes a.e. elsewhere. (This is easily seen using the argument 
in the proof of Theorem 16.41 below.) 

6. Branching process results 

In this section we collect the branching process results we shall use. These 
are all simple consequences of the lemmas in the previous section. In this 
section, k will always be a kernel on a measure space i.e., a sym- 

metric non- negative measurable function on 5 x 5. Unless explicitly stated 
otherwise, /u will be a probability measure, i.e., p{S) = 1. We shall assume 
that K S L^; as noted in the previous section, it follows that ()5.1|) holds a.e. 
X. We do not assume that (|5.1|1 holds for every x except when explicitly 
stated. 

Theorem 6.1. Suppose that n is a kernel on the space {S,p), that k £ , 
and that ()5.H) holds for every x. Then the function defined by Pn{x) = 
p{k;x) is the maximum solution of (|5.3|) . Furthermore: 

(i) // ||Tk|| < I, then p{k;x) = for every x, and (|5.3jl has only the 
zero solution. 

(ii) If I < ||Tk|| < oo, then p{k;x) > on a set of positive measure. If, 
in addition, k is irreducible, then p{k;x) > for a.e. x, and p{k;x) 
is the only non-zero solution of 

In particular, p(k) > if and only if ||Tk|| > 1. 

Proof. The first statement is just part (iv) of Lemma 15.61 The remaining 
statements follow directly from Lemmas 15.111 [^?TH1 Lemma l5.^[iv)| and l5.1UI 

□ 

The next result is essentially a restatement of Theorem 16.11 in the setting 
of the results in Section |31 Thus, p will not necessarily be a probability 
measure, and we shall not require that (|5.H) holds; this makes very little 
difference. This result gives the promised characterization of p{k; x) and 
p{k) in terms of a functional equation, in the full generality of the setting 
of Theorem 13.11 



38 



BELA BOLLOBAS, SVANTE JANSON, AND OLIVER RIORDAN 



Recall that (|2.17j) defines only for non-negative functions; we thus 
consider only non-negative solutions to (|6.1j) below. 

Theorem 6.2. Let n he a kernel on a (generalized) ground space {S, /x), with 
K G L^{S X S,fix fi). There is a (necessarily unique) TRdXlTflUTYl solution 
to 



i.e., a solution that pointwise dominates all other solutions. Furthermore, 
p{n;x) = Pk{x) for a.e. x, and 



where the function is defined by Pk{x) := p{k;x). 

If \\Tk\\ < 1; then is identically zero, and this is thus the only solution 
to H6.1() . If ||Tk|| > 1, then is positive on a set of positive measure. Thus 
p{k) > if and only if ||Tk|| > 1. 

If \\Tk\\ > 1 <ind K is irreducible, then is the unique non-zero solution 
to (|6.1|) . and p^^ = p^ > a.e. 

Theorem l6.'2l follows almost immediately from Theorem l6. H and Lemma fS-ll 

Proof. Multiplying k by a constant factor and dividing p by the same con- 
stant factor does not affect the definition of the branching process 
Hence, the function p^ is not affected by this rescaling. As the operators 
and $ 1^ are also unchanged, we may assume without loss of generality that 
p{S) = 1. As noted in Sectional since k £ there is a kernel k with R = k 
a.e., such that ()5.1() holds for R for every x. Applying Theorem 16.11 to the 
kernel R, the result follows by Lemma l5. H and Remark 15.51 □ 

We now study monotonicity and continuity properties of p{n; x) and p{n) 
when K is varied. For the rest of the section, we assume that p{S) = 1. As 
usual, we say that a sequence of functions fn increases ( a. e.) to a function 
/ if for every x (a.e. x) the sequence fni-c) is monotone increasing and 
converges to f{x). As before, we write p^ for the function given by Pk{x) ■= 
p{k;x). We start with a trivial lemma. 

Lemma 6.3. // ki < K2, then p^^ < p^^.^. 

Proof. Immediate by coupling the branching processes. □ 

Theorem 6.4. (i) Let [i^n)]^ be a sequence of kernels on (S, p) increasing 
a.e. to K. Then Pk„ Pk for a.e. x and p{Kn) y p{k). 

(ii) Let {nn)]^ be a sequence of kernels on {S,p) decreasing a.e. to k. 
Then p^,, \ p^ for a.e. x and p{Kn) \ 

Proof. On the measure zero set where k„ -/-> k, redefine all k„ and k to be 
0. By Lemma l5.H this does not affect the conclusions, so we may assume 
^ K or \ K everywhere. Similarly, we may assume that (|5.H) holds 
for every x, for each k„ and for k. It suffices to prove the conclusions for 
p^: the conclusions for p(k) follow from (|2.13p and dominated convergence. 



(6.1) 



^k(Pk) = Pk a.e. 



(6.2) 
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(i) We choose a partition S = (Ji=o'^« ^ ™ Lemma l5.17| and redefine k 
and all «:„ to be on [S x 5) \UiLi('5i x 5^); this only changes the kernels on 
a set of measure zero, so we may again apply Lemma l^lTl Now = Pk = 
on Sq. We may consider each Si, i > 1, separately, and we may thus assume 
without loss of generality that k is irreducible. The only problem is that 
the restriction of fj, to Si does not have total mass 1, but this is not a real 
problem, and can be handled by renormalizing, i.e., dividing the measure 
by fJ.{Si) and multiplying all kernels by the same factor; as remarked earlier, 
this operation does not affect the branching process. 

We have shown that we may assume that k. is irreducible; let us do so. By 
LemmaESl if m < n, then p^^ < Thus {pk„) is an increasing sequence 
of functions, all bounded by 1, so the limit p%{x) := lim^^oo Pk„ (2;) exists 
everywhere. By monotone convergence. 



so ^>K/0* = lim„^oo ^k„/Ok„ = liuin^oo Pk„ = P%- Hence, by Lemma EUHl 
either /?* = p^, and we are done, or p* = 0. In the latter case, each = 0, 
and thus, by Lemma 15. 161 \\Tk„\\ < 1. 

Hence, if f e with / > and ||/||2 < 1, then ||r«;„/||2 < 1. Monotone 
convergence shows that, as n ^ 00, T^^f / T^,f and ||Tk„/||2 y \\Tf,f\\2. 
Consequently, ||Tk/||2 < 1 for each such /, and thus ||Tk|| < 1. By Theo- 
rem = in this case, so = p* in this case too. 

(ii) This is similar. Now {pk„) is a decreasing sequence of functions, 
and p%{x) := lim^^oo /Ok„ (2;) still exists everywhere. By dominated con- 
vergence, {Tf,pl){x) = limn^oo{Tf,„pKj{x), so = lim„^oo = 
liuin^oo Pk„ = P*K- In other words, /?* satisfies (|5.l^j) . Furthermore, by 
Lemma [6.31 again, p^^ > Pk, so p* > Since p^ is a maximal solution 
to (|5.:-{j) by Lemma p* = p^. □ 

Theorem 6.5. (i) Let (Kn)i° be a sequence of kernels on {S,p) increasing 
a.e. to K. Then, for every k > 1, p>k{i^n'-,x) p>k{K;x) for a.e. x and 

P>k{Hn) / P>k{K). 

(ii) Let {Kn)T be a sequence of kernels on (5,p) decreasing a.e. to k. 
Then, for every k > 1, p>fc(K„;x) \ p>fc(K;x) for a.e. x and p>k{K,n) \ 
P>k{f^)- 

Proof. As in the proof of Theorem 16.41 we may assume that Kn k or 
Kn\ K everywhere, and that ()5.1|) always holds. 

(i) Let Ko '■= and Ak„ := k„ — n > 1. The children of a particle 

of type x are given by a Poisson process with intensity n{x,y) dp{y) = 
Ak„(x, y) which can be represented as the sum of independent 

Poisson processes with intensities AK„(x,y) dp,[y). We label the children in 
the nth of these processes by n, and give the initial 'root' vertex label 0. This 
gives a labelling of all particles in the branching process (which starts 




i^{x,y)pl{y)dp{y) = lim / Kn{x,y)pt,^{y) dp{y) 




lim (T«,„p«„)(x) 



n 
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with a single particle of type x) such that the subset of all particles that, 
together with all their ancestors, have labels at most n gives the branching 
process Xk„{x). Consequently (using this coupling of the processes), the 
family tree of the initial particle in will grow to its family tree in 

X^,{x) as n ^ oo. Hence /9>fc(K„;x) /" p>k{K.;x) and p>k{Kn) / p>k{i^)- 

(ii) We may similarly label all particles in (x) with labels {1,2,..., 00} 
such that consists of all particles that, together with their 

ancestors, have labels at least n [00]. By Remark 15.21 a particle always has 
a finite number of children, so a particle survives for eternity if and only if 
it has infinitely many descendants. By Theorem 16.41 we have 

p(Km;2;) \ (6.3) 

for a.e. x. Fix any x for which (|6.3j) holds. Writing |j£K„(ic)| for the total 
population of the branching process Xk„(x), whenever |Xk^(x)| < 00 for 
some ?Ti, we have 

(x)| \ |Xk(x)| as n 00; indeed, for large n the 
entire processes X^„{x) and Xn{x) coincide. From (|6.3|) . with probability 1 
either |Xk(x)| = 00, in which case |j£K„(x)| > |XK(a;)| = 00 for all n, or there 
is an in with |Xk^(x)| < 00, in which case |Xk„(x)| = |Xk(x)| for all large 
enough n. Thus, the events |Xk„(x)| > k converge a.e. to |XK(a;)| > k, and 
p>k{K„;x) \ p>kiK;x). □ 

Suppose that k is supercritical (i.e., that ||Tk|| > 1), and assume for sim- 
plicity that (|5.H) holds for every x. Consider the branching process Xk{x) 
starting with a particle of type x, and classify its children in the first genera- 
tion according to whether they have infinitely many descendants or not. By 
the properties of Poisson processes, this exhibits the children as the union 
of two independent Poisson processes with intensities k{x, y)pK{y) dp{y) and 
K{x,y){l — pi^{y)) dp{y) respectively, where the first litter consists of the 
children with infinitely many descendants, or, equivalently, those whose de- 
scendants live for ever. 

The process ^^(s;) eventually becomes extinct if and only if the first 
litter is empty. It follows that if Xk{x) denotes the branching process 3Ck(x) 
conditioned on extinction, then Xk(x) is itself a multi-type Galton- Watson 
branching process, where the set of children of a particle of type z is given 
by a Poisson process with intensity k{z, y){l — Pniy)) dp{y). This is another 
instance of the situation studied here, with p replaced by p defined by 
dp{y) := (1 — Pniy)) dp{y), except that p is not a probability measure - this 
is unimportant since we can normalize and consider /?' := (1 — p{k))k and 
p' := (1 — p{k))~^p; see Definition 13.151 and the discussion following. 

The process Xk(x) dies out by construction, and is thus subcritical or 
critical. Example 112.41 shows that it can be critical (even when k is irre- 
ducible). In many cases, however, X^ix) is subcritical; we give one simple 
criterion. 

Lemma 6.6. Suppose that k is irreducible and that \\T^^\\ > 1. If g > is 
integrable and such that Tk((1 — Pk)9) > 9 o-e., then g = a.e. 
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Proof. We may assume that holds for every x. By Theorem 15.11 and 

Lemma 15. (^t hi) I (1 — Pk)TkPk < Pk a-e. If 5 > on a set of positive measure, 
then 

/ gp^dp> / g{l - Pf,)T^p^dfi = / PKTn{g{l - Pn)) dp > / p^gdp, 
Js Js Js Js 

a contradiction. □ 

Theorem 6.7. Suppose that n is a quasi-irreducible kernel on {S,p), and 
that ||Tk|| > 1. Let p be the measure defined by dp{y) = (1 — p^{y))dp{y), 
and let be the corresponding integral operator 

T^g ■■= / K{x,y)g{y)dp{y) = T^{{1- p^)g). 
Js 

Then \\f^\\L^(fi) < 1. 

//, in addition, ffg2K{x,y)'^dp{x)dp{y) < 00, then ||Tk||/^2(^) < 1. 

Note that with k' = (1 — p{k))k and p' = (1 — p{k))^^p as above, we have 
= Tj^/, where T^' is defined by T^'g{x) := J^k' {x,y)g{y) dp' (y). Thus 

II^k||l2(^) = ||TK/||i2(JJ/). 

Proof. We may assume (|5.ip and that k is irreducible. The discussion above 
and Theorem 16.11 show that ||rK'||i2(^/) < 1, as Xi^{x) dies out by construc- 
tion. (An analytic proof is easily given too, using a truncation of k. and the 
argument below for the second part.) 

For the second part, the additional assumption implies that 
JJg2K'{x,y)'^ dp'{x) dp'{y) < 00, so ||T^'||_ff5 < 00. Lemma 15 . 1 5 1 shows that 
Tk = Tj?' has an eigenfunction g > with eigenvalue ||Tk||, and thus 

WTnWg = TKg = T^,{{1 - p^)g) a.e. 
If II^kII ^ 1) this contradicts Lemma 16.61 □ 



With a few exceptions, in the rest of the paper we shall not refer directly 
to the lemmas in Section |21 the results in this section describe the properties 
of the branching process we shall use. 

7. Approximation 

Li this section we introduce certain upper and lower approximations to a 
kernel k on a (generalized) ground space (5, p), in preparation for the study 
of the random graph [n, Kn). Recall that 5 is a separable metric space, 
and that p\s a. Borel measure on S with < p{S) < 00. We usually assume 
that p{S) = 1; in this section, this makes no difference. Here the metric and 
topological structure of S will be important. 

Given a sequence of finite partitions Vm = {^mi, ■ ■ ■ , ^mMm}^ m > 1, of 
S and an x £ S, we define imix) by 

X G Am,i^^x)- (7.1) 
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As usual, for ^4 C 5 we write diam(A) for sup{d{x,y) : x,y £ A}, where d 
is the metric on our metric space S. 

Lemma 7.1. Let be a (generalized) ground space. There exists a 

sequence of finite partitions Vm = {Ami, ■ ■ ■ , AmMm}> fn ^ ^7 of S such that 

is measurable and fi{dAmi) = 0; 

(ii) for eachm, Vm+i refines Vm, i-^., each Ami is a union\]-^j^, Am+ij 
for some set Jmi', 

(iii) for a.e. x £ S, diam{Am^i^(x)) as m ^ co, where im{x) is 
defined by (|7.1j) . 

Proof. If 5 = (0, 1] and is continuous, e.g., is the Lebesgue measure, 
we can take Vm as the dyadic partition into intervals of length 2~"^. If 
5 = (0, 1] and ^ is arbitrary, we can do almost the same; we only shift the 
endpoints of the intervals a little when necessary to avoid point masses of 

In general, we can proceed as follows. Let zi, Z2, . . . be a dense sequence 
of points in S. For any Zi, the balls B{zi,r), r > 0, have disjoint boundaries, 
and thus all except at most a countable number of them are ^-continuity 
sets. Consequently, for every m > 1 we may choose balls Bmi = B{zi,rmi) 
that are ^-continuity sets and have radii satisfying 1/m < rmi < 2/m. Then, 
Uj Bmi = S, and if we define B'^^ := Bmi \ [Jj<i ^mj, we obtain for each m 
an infinite partition of 5 into /^-continuity sets, each with diameter 

at most 4/m. To get a finite partition, we choose large enough to ensure 



that, with B'^Q := U,>^„ B'^^, we have fi{B'^o) < 2""^; then {i?;,Jfj^ is a 



partition of S for each m, with diam(i?^j) < 4/m for i > 1. 

Finally, we let Vm consist of all intersections HI^i ^'h with < ii < Nf, 
then conditions |(i)| and |(ii)| are satisfied. Condition |(iii')| follows from the 
Borel-Cantelli Lemma: as ^mf^i-^'mo) finite, a.e. x is in finitely many of 
the sets B'^q. For any such x, if m is large enough then x £ B'^^ for some 
i > 1, so the part of Vm containing x has diameter at most diam(i?^j) < 
4/m. □ 

Recall that a kernel k on (5, ji) is a symmetric measurable function on 
5x5. Fixing a sequence of partitions with the properties described in 
Lemma im we can define sequences of lower and upper approximations to 
K by 



We thus replace k by its infimum or supremum on each Ami x ^mj- As K. 
might be +00, we shall use it only for bounded k. 



I^m{x,y) := m.i{K{x,y') : x' £ Am4.^(x), v' ^ ^m,i^{y)}, 
l^m{x,y) := SUp{K(x',y') : x' £ Am,i^(x), y' e ^m,j„{y)}- 



(7.2) 
(7.3) 



By Lemma 17. ]|[ii)[ 



'm — '^m+1 



and > K^^^. 
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Furthermore, if k is continuous a.e. then, by Lemma 17. ]|tiii)[ 

i^m{x,y) K{x,y) and K+(x,y) K{x,y) for a.e. {x,y) e 5^. (7.4) 

Since /t~ < k, we can obviously construct our random graphs so that 
G{n,K^) C G{n,K); in the sequel we shall assume this. Similarly, we shall 
assume that G{n, k+ ) 5 G{n, k) when k is bounded. 

If (k is a graphical sequence of kernels with limit k, we define instead 

'^■m{x,y) ■■= inf{(K A : x' G ^^^^^(a,.), y' e ^m.i^fe), n > m}. 

By Lemma I7.]|[ii)[ we have < k^+i, and from Lemma I7.]|t iii)| and 
(fmi|) we see that 

K^{x,y) Kix^y) as m ^ oo, for a.e. {x,y) G S^. (7-6) 

Moreover, when n > m we have 

> (7-7) 

and we may assume that G(n, k~ ) C G(n, 

For a uniformly bounded graphical sequence of kernels with limit k, 
we similarly define 

i^mix,y) ■■= sup{(KVK„)(x',y') : x' G y' G ^m.i™(y), n > m} < oo. 

(7.8) 

Relations corresponding to (|7.6j) and (|7.7|) hold for k^; we collect these 
and an additional result in the following lemma. 

Lemma 7.2. Let {nn)nei be a graphical sequence of kernels on a (general- 
ized) vertex space V with limit k, and suppose that sup^, ^ „ k„(x, y) < oo. 
Then there is a sequence k^, m = 1, 2, . . ., of regular finitary kernels on V 
with the following properties. 

(i) We have K^{x,y) \ K{x,y) as m ^ co for a.e. {x,y) G . 

(ii) Whenever n > m we have K^{x,y) > Kn{x,y) for every (x,y) G S^. 

(iii) IIT + II \ IITkII as m ^ oo. 

Proof. Let Vm = {^mi, ■ ■ ■ , ^mMm}^ 771 > 1, be a sequence of partitions with 
the properties described in Lemma lTTI and define k^{x, y) by 1)7. 8|) . (If 



K for all n, this is just ()7.3|) .') Then (ii) holds trivially. By Lemma 17. ]|r iii) 



and (|2.1U|1 . k+ \ k a.e., proving (i) Finally, by dominated convergence 



K+ -TJhs^O- Hence, 

||7;|| < ||T^±|| < ||T«|| + ||T^+ -T«|| < ||r«|| + ||r^+ -T^Whs \ \\Z 
proving 



Kill 



(iii) □ 



We finish this section with a result for lower approximations corresponding 
to Lemma l7.2| but with one additional ingredient: for lower approximations 
to be useful we shall often need them to be quasi-irreducible. 
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Lemma 7.3. // {K.n)nei is a graphical sequence of kernels on a (generalized) 
vertex space V with limit k, there is a sequence k~ , m = 1, 2, . . of regular 
finitary kernels on V with the following properties. 

(i) If K is quasi-irreducible, then so is «;~ for all large m. 

(ii) We have k^{x,y) n{x,y) as m ^ oo for a.e. {x,y) £ . 

(iii) Whenever n > m we have k~ < Kn{x,y) for every {x,y) £ . 

Before turning to the proof, let us note that the conclusions of the lemma 
are obvious for suitably 'nice' kernels k (or sequences for example 

if K is continuous, S is compact and k > 0. Indeed, if we partition S 
into finitely many pieces Si in a suitable way, we may then set k^{x,y) = 
inf{K(x', y') : x' £ Si, y' G Sj} whenever x G Si and y £ Sj. Note also that 
in the application we shall need condition |(iii)| for every {x,y) G S^: while 
changes in a kernel k on a set of measure zero do not affect the branching 
process X^, they can affect the graph G^{n, k). 

Proof of Lemma \7.'J\ We may assume that k > on a set of positive mea- 
sure, as otherwise we may take k~ = for every m and there is nothing to 
prove. We shall construct the sequence in two stages. 

Let Vm = {-A-mii ■ ■ ■ 1 AmMm}i ^ ^ 1, be a sequence of partitions with the 
properties described in Lemma l7.ll If k„ = k for all n, we start with k~ 
defined in (|7.2|) . In general, with a sequence k„, we use instead the definition 

m- _ 

Each is of the regular finitary type treated above, and the have 
two of the properties required for the k~ , namely (ii) and (iii), by (|7.6j) and 
1)7. 7() . respectively. However, (i) may fail, as some k~ may be reducible. 
From now on we shall assume that k is quasi-irreducible, as otherwise we 
may take = k~. In fact, without loss of generality we may assume 
that K is irreducible. Indeed, it suffices to prove this case as, given a quasi- 
irreducible AC, we may then apply the result to the irreducible restriction to 
S' xS', and extend the approximating k~ obtained to 5 x 5 by taking them 
to be zero off S' x S'. We shall thus assume that k is irreducible. 

If K~ = a.e. for every m, then k = a.e. by (|7.6|) . contradicting our 
assumption. We may thus assume that there exists an mo such that k"^ > 
on a set of positive measure. We consider only m > tuq, and assume for 
notational convenience that mo = 1. Thus there exist i and j (possibly 
equal) with iJ,{Aii) , fi{Aij) > and kJ" > on An x Aij. From now on we 
fix such a pair i and j. 

For m > 1, let Em := Ui^mi : K^mi) = 0}, noting that fi{Em) = 0, 
and let Bm be the set of all x G 5 such that for some A; > 1 there exists a 
sequence xq, . . . ,Xk with xq = x, Xk £ An, k~ (x/_i,Xi) > 0, and xi ^ Em 
for I = 1, . . . , k. (Note that x = xq may belong to Em-) Since k~ is constant 
on each Amp x Amq, Bm is a union of some of the sets Amp- It is easily seen 
that Bm Q Bm+i, that Bm 1^ Bi ^ Aij, that the restriction of k~ to Bm 
is irreducible and that = on {Bm \ Em) x (5 \ Bm) and thus a.e. on 

Bm X {S \ Bm)- 
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Let B := U^i?m- If n > m, then Bm ^ Bn and thus = a.e. on 
Bm X {S \ B) CI Bn X (5 \ B„). Letting n ^ oo, (|7.6j) shows that k = a.e. 
on Bm X (S \B). Letting now m ^ oo (taking the union) yields «; = a.e. 
on B X (S\B). Since k is irreducible, it follows by (|2.12j) that /i(-B) = or 
li{S \B) = 0. As 5 D Si D Aij, we have ^(5) > 0, so /x(5 \B) = 0. In 
other words, a.e. x £ B = \J„^Bm- 

Now define 

Thus K" is off Bfn x B^m and the restriction to B^ is by construction 



irreducible and of the regular finitary type, so condition (i) of the lemma is 
satisfied. Furthermore, by (|7.6() and the fact that Bm B with fi{S\B) = 0, 
we have km{x,y) K(a;,y) as m ^ oo for a.e. {x,y) S 5^, so (ii) holds. 



Finally, if n > m, then km < < so (iii) holds. □ 



8. The number of edges 

In this section we consider circumstances in which the condition (|2.9() or 
1)2.11(1 on the convergence of the number of edges in G^(n, k) does, or does 
not, hold. In doing so, we shall make frequent use of the approximating 
kernels k~ and «;+ defined for a single kernel k by (|7.2|) and (|7.3|) . and for 
a sequence by ()7.5|) and (|7.8() . As before, we shall always write the (gener- 
alized) vertex space V under consideration as (5,/i, (x^)), unless otherwise 
specified. 

Lemma 8.1. Let k be an a.e. continuous kernel on a (generalized) vertex 
space V. Then 

limmf ^Ee{G^ {n, k)) > I jj K{x,y) dfi{x) dfi{y). (8.1) 
If K, is a bounded a.e. continuous kernel on a vertex space V, then 

lim-Ee{G^{n,K)) = I I I K{x,y) dn{x) dfi{y). (8.2) 

Proof. Write G{n,K) for G^{n,K). Consider first the regular finitary case 
defined in Definition 14. 41 For n > max k, conditioning on ni, . . . , we have 



1 1 1 

-Ee(G{n,K) | rii, . . . , n,.) = — {niUj - ni6ij)-K{i,j) 

V K{i,j)fj.{Si)fi{Sj) = \ 1 1 n{x,y)dfi{x)dn{y). (8.3) 



p 1 

2 



Taking expectations and applying Fatou's Lemma, it follows that (|8.1() holds 
in this case. 
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In general, to prove (|8.1j) we use Lemma [TTI and the approximation (|7.2|) . 
For every m, by the case just treated, 

hminf — EefGfn, k)) > hminf — E e(G(n, k~ )) = h [ [ k~. 

As m — > oo, the monotone convergence theorem imphes thatJJ k~ ~^ JJ 
and (|H?T|) follows. 

If V is a vertex space and k is regular finitary, then the left-hand side of 
H8.3() is bounded by maxK/2, so by the dominated convergence theorem we 
have y^Ke[G{n, k)) ^ JJ In general, if V is a vertex space and k is 
bounded, we can use k+ in place of At~ to show that lim sup^^^^^ - E e(G(n, k)) 
<i//K. □ 

Remark 8.2. Condition ()8.2() may fail for a generalized vertex space V, even 
if K is constant. The problem is that the definition of a generalized vertex 
space only imposes 'whp conditions' on the number of vertices, giving no 
control on the distribution in the o(l) probability case that these conditions 
fail, and hence giving no control on expectations. In particular, with k iden- 
tically 1, the expected number of edges is essentially ^ E(|'\/(G^(n, k))^), 
and we have no control over this expectation - it can even be infinite. 

When the number Vn of vertices is sufficiently concentrated (for example 
Poisson), this problem does not arise. Indeed, H8.2|) holds whenever k is 

bounded and Var(t>„/n) —>■ 0; since Vn/n ^ by assumption, the vari- 
ance condition is easily shown to be equivalent to E(t;„/n)^ —>■ and to 
imply uniform integrability of (fn/n)^; see e.g. |61l Proposition 4.12]. (If the 
parameter n is not restricted to integers, we may have to consider a sequence 
of indices n.) Since the left-hand side of (|8.3() is bounded by maxK (u„/n)^, 
which is also uniformly integrable, we may take the expectation in ()8.3() and 
obtain (jHUl). 

Our main results concern statements that hold whp, and convergence in 
probability of various quantities. For such statements, a small chance of a 
very large number of vertices is not a problem. 

The following lemma shows that the condition to be graphical is essen- 
tially equivalent to a statement about approximations with bounded kernels. 

Lemma 8.3. Let k be a bounded a.e. continuous kernel on a (generalized) 
vertex space V. 

If V is a vertex space, then k is graphical if and only if 

(i) for every e > there exists an M < oo such that 

lim sup - Ee(G^(n, k) \ in, k A M)) < e. 
n 

In general, k is graphical if and only if (i) holds together with 

(ii) for every M < oo, 

lim sup — E e(G^(n, k A M)) <hJJ '^(^' ^l^i^) dfi{y). 
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Proof. It is obvious that (i) and (ii) imply that lim sup - 

^ 7/52 n{x,y) dfi{x) dfj,{y), which together with Lemma |H3 shows that k is 

graphical. 

Conversely, suppose that k is graphical on V. Then, from the definition 
of graphicality (see ()2.9|) ). Ee(G^(n, k)) — > 5// k. Applying Lemma IHH to 
K A M, it follows that 

limsup-Ee(G^(n,K)\G^(n,KAM)) 
n 



K-liminf iEe(G^(n,K AM)) <\ j f {k - k A M) < 



e 



if M is large enough. □ 



Note that (ii) almost always holds by Lemma l8. II and Remark 18. 21 Argu- 



ing as in the proof of Lemma l8.31 one can show that (ii) can be replaced by 



the condition that each k A M be graphical; we omit the details. 

Remark 8.4. Lemma l8 . 31 implies that, if k is a graphical kernel on a (gen- 
eralized) vertex space V and < c < 00, then cn is also graphical on 



V. Indeed, it suffices to check condition (iii) of Definition 12.71 namely that 
^Ke[G{n,cK)) ^5// CK{x,y). Without the min{-, 1} in the formula (|2.8|) . 
this would be immediate from the same condition for k; indeed, the claim 
that CK is graphical is equivalent to the claim that replacing this 1 with 1/c 
does not affect Ee(G(n,K)) by more than o(n). 



Since ck A cM = c{k A M), it is obvious that condition (ii) of Lemma 
holds for CK if and only if it holds for k. Moreover, if n > M, 

^E(e(G"^(n,fi;)\G^(n,KAM)) | x^) = n~'^^[K{xi,Xj)An-K{xi,Xj)AM). 

i<j 

It is clear that if we replace k by ck and M by cM, and assume n > 2(1V c)M, 



then this sum changes by at most a constant factor. Hence condition (i) of 
Lemma 18.31 also holds for ck if and only if it holds for k. 

Lemmas 18.11 and 18.31 hold also for the variants of G^{n,K) defined in 
Remark 12.41 by the same proofs. Moreover, it is easily seen that conditions 
(i) and (ii) of Lemma 18.31 hold for one of these versions if and only if they 
hold for G^{n, k). Hence k is graphical if and only if the analogue of (|2.9|) 
for one of these variants holds. 

The results above can be extended to sequences (k„) satisfying (|2.1fl|l . 
using the approximations k~ defined by (|7.5j) . In particular, a similar argu- 
ment shows that if ( graphical sequence of kernels on a (generalized) 
vertex space V with limit k, and (c„) is a sequence of positive reals with 
c„ — > c > 0, then {cnKn) is graphical on V with limit ck. 

Let us emphasize that relation (|8.2|) . i.e., condition (|2.9|) from the defi- 
nition of graphicality, often holds for unbounded k too, and for generalized 
vertex spaces V. One example is when the Xi are random as in Example 14. 8| 
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another is the Poisson process case in Example 14.91 A rather different ex- 
ample is the following. 

Example 8.5. Suppose that S = (0,1], fj, is the Lebesgue measure and 
Xi = i/n; this vertex space was considered in Example 14.71 and will be used 
in several further examples in Section 1161 with several different kernels. 
Suppose that k is decreasing in each variable, so K{x,y) > k{x' ,y') when 
X < x' and y < y'. Then 

-Ee(G(n,K)) = ^^K(i/n, j/n) An < - / / K{x,y)dxdy. 

Hence (|8.1() implies that 1)8.2(1 holds in this case. Note that this includes 
both (tTTni and (ITOl) . 

We next give a simple example where (|2.9|) . i.e., ()8.2() . fails. 

Example 8.6. Take again S = (0,1], let /i be the Lebesgue measure and 
set Xi = i/n. Let < 5 < 1 be constant and define k by 



K{x,y) 



m if x Ay = 1/m and m > 1; 
S otherwise. 



Note that k = 5 a.e., and hence p{k) = p{5) = as for G(n, 6/n) in Exam- 
ple furthermore, k is continuous a.e. 

Now, K{l/n, j /n) = n for every j < n. Hence, G(n, k) contains the 
star consisting of all edges Ij, 1 < j < n, so G(n, k) is connected and 
Ci{G{n, k)) = n, although, as remarked above, p{k) = 0. Consequently, 
(j|-{.2j) fails in this case. Note that all assumptions of Theorem . 1 1 ar e satisfied 
except (|2.9j) : indeed, e(G(n, k)) >n — 1 while JJg2 K{x,y) dfi{x) dp{y) =5. 

We can modify this example to make k continuous on (0,1]^: for < 
e < 1/4, let Ke{x,y) = <j){x A y) with (j){l/m) = m, 4){l/m =b em~'^) = 6, 
and 4> linear in between. If e is small enough, then ||Tk^|| < 1 (because 
the Hilbert-Schmidt norm satisfies ||T'k^||//s' — > 5 as e ^ by dominated 
convergence); thus p{ks) = 0, although Ci(G(n, k^)) = n. 

We next give a result on the number of edges conditioned on ; this time 
we consider a sequence (k„) of kernels. 

Lemma 8.7. Let (k„) be a graphical sequence of kernels on a (generalized) 
vertex space V with limit k. Then 

^E(^e{G^{n,Kn)) j j ^K{x,y) dp{x) dp{y). 
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Proof. Let W„ := E(e(G^(n, \ii.n) /n andw := \ jjg2 K'{x,y) dp{x) dp{y). 
By our assumption ()2.11|) . we have E Wn w. 
Define k~ by (|7.5|) . By (|8.H|) . applied to k~ , 

Wjr^ :=E(e(G^(n,K-)) | x„)/n^u;^ := i j j ^^K;^{x,y) dp{x) dp{y). 
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Let e > be given. By (|7.(i|) and monotone convergence, Wm w as 
m — > oo, so we may choose m such that Wm > w — e. For n > m we have 
Wn > Wn^^ , and hence 

F{Wn <w-2e) < F{wi"'^ < - e) ^ as n — > oo. 

Hence, writing /_ for —(/AO), we have (Wn — w)^ and, by dominated 
convergence, E(l^„ — 0. Consequently, E|l^„ — u)| = 2K{Wn — w)- + 
E(Wn -w)^0. □ 

Remark 8.8. Recahing (|2.2j) or (|2.4j) . the convergence condition for the 
empirical distribution f„ of the types of the vertices in a (generalized) vertex 
space, we have Vn ^ and Wn w (in the notation of the proof above), 
where Vn and Wn are functions of x„. Coupling the x„ for different n 
appropriately (a simple application of the Skorohod coupling theorem |611 
Theorem 4.30]), or considering appropriate subsequences, we may assume 
that Vn A* and Wn — > w a.s. Consequently, we may condition on x„, and 
assume that (|2.2() and 1)2. llj) still hold. In other words, after conditioning on 
x„, V is still a (generalized) vertex space and («;„) is still graphical with limit 
K. By conditioning in this way we may thus assume that x„ is deterministic; 
see Subsection 18.11 

Our next result shows that the number of edges is concentrated, so that 
the actual number converges as well as its mean. 

Proposition 8.9. Let (k„) he a graphical sequence of kernels on a (gener- 
alized) vertex space V with limit k. Then 

^e(G"'^(n, Kn)) ^ \ j j^^ k(x, y) dfj,{x) dn{y). 

Proof. Let G„ = G^{n,Kn) and, as above, Wn = E(e(G„) | x„)/n. Condi- 
tioned on Xn, the number e{Gn) of edges is a sum of independent Be{pij) 
variables, and thus Var(e(G„) | x„) < E(e(G„) | x„). Hence, using ()2.1H) . 

E(e(G„)/n -Wn)^ = n'^ E(Var(e(G„) | x„)) < n-2E(e(G„)) ^ 0. 

Consequently, e(G„)/n — W„ ^ 0, and the result follows by Lemma IHTTl □ 

Finally, we note that small sets of vertices do not connect to too many 
edges. For this we need a simple lemma. 

Lemma 8.10. Let k be a bounded kernel on a (generalized) vertex space V, 
and let Gn = G^{n, k). Then ^gS'^) ~ 0{n) whp. 

Proof. Let a := 2fi{S) and b := sup K{x,y) < oo. Then whp Vn < an, and 
thus, in the natural coupling, G^(n,K) C G{[an\,b/n) whp. Consequently, 
it suffices to prove the result for G{[an\,b/n) or, changing the notation 
slightly, for G{n,c/n) for every fixed c > 0. However, for any graph G, 
Y^^daii) = 2e(G) and c?G(0(f^G(«) - 1) = 2P2(G), twice the number of 
paths of length 2 (cf. Section [T7|l . It is well-known, and easy to prove, that 
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e{G{n,c/n))/n ^ a\ and P2{G{n,c/n)) /n 02 for some constants 01,02 
(depending on c), see e.g. 59, Chapter 3 and Theorem 6.5]. Consequently, 
with C := 2ai + 2a2 + 1, Yli '^G'(n c/n)(*) ^ whp. (Alternatively, we may 
use Theorem ll7.1l ) □ 

Proposition 8.11. Let (k„) he a graphical sequence of kernels on a (gener- 
alized) vertex space V with limit k. Given e > 0, there is a 5 > so that whp 
the sum of the degrees of any set of at most 5n vertices of Gn = G^ {n, Kn) 
is at most en. In particular, any set of Op{n) vertices of Gn has Op{n) 
neighbours. 

Proof. Let dciv) denote the degree of vertex v in a graph G, and let k~ 
be as in Lemma [7.31 Since k~ y k a.e., // k~ JJ k, and thus we can 
choose m such that //k~> JJ k — e. Let G'n := G^{n,k-). For n > m 
we have k~ < k„, and we may as usual assume that G'^ C G„. Moreover, 
Proposition 18.91 applies to both G„ and G'^, so 

^ E {dGAi)-dG'M = -e(G^(n,K„)\G^(n,K-)) 



Hence, whp 

E fe(i)-rfGa^)) <2en. (8.4) 

i6V(G„) 

By Lemma [8. 101 applied to G'^, there is a constant C < 00 such that whp 
X^i ^G" (^) ^ Hence, if 5 = e^/C, the Cauchy-Schwarz inequality shows 
that whp for every set A C V{Gn) with \A\ < 5n, 

Y^dc'Ji) < < {\A\CnY/' < en. 

Combining this with 1)8. 4|) . we obtain X]iGA^Gn(^) < 3en, whp for all such 
A, and the result follows by replacing e by e/3. □ 

8.1. Generalized vertex spaces. Our main results concern graphical se- 
quences of kernels on generalized vertex spaces, expressing properties of the 
graphs G^{n,Kn) in terms of the limiting kernel k. As noted earlier, it is 
intuitively clear that we lose no generality by restricting our attention to 
vertex spaces. Furthermore, as noted in Remark 18.81 we may assume that 
the vertex types x„ are deterministic. As we shall now see. Lemma 18.71 and 
a simple probabilistic lemma given in the appendix imply precise forms of 
these assertions. We start by showing that we may take the sequences x„ 
to be deterministic. 

Let Kn be a graphical sequence of kernels on a generalized vertex space 
V with limit k. As noted in Remark 18.81 by coupling appropriately we may 
assume that, after conditioning on (x„), the triple (5,/n, (x„)), in which the 
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sequences x„ are now deterministic, is (a.s.) still a generalized vertex space, 
and that k„ is (a.s.) graphical on this space with limit k. Almost all our 
results assert that (given some e > 0) a certain event En holds whp; recall 
that statements like Xn ^ a and X„ = Op(o„) can be expressed in this 
form. The x„ deterministic case of such a result then implies that (a.s.) 
the conditional probabilities P(i?n I x„) tend to 1. Taking expectation with 
respect to the random sequences x^, it follows by dominated convergence 
that f{Sn) Ij i-e., the result holds also for random x„. 

Some of our results are of the form Xn = 0(a„) whp. Again, it suffices 
to prove such a result for deterministic x„; the general case then follows 
by Lemma lA.51 with 1^ = f„, and Ain the set of all measures of the form 
Y^i 6xi, a subset of the metric space Ai of all finite Borel measures on 
S. The key point is that i/„ determines x„ up to relabelling the vertices, and 
that the conditional distribution of the unlabelled graph G^(n, k„) given x„ 
does not depend on the labelling, or on x„/, n' ^ n. 

We now turn to the simple reduction from generalized vertex spaces to 
vertex spaces. Although the arguments apply to all our main results, for 
definiteness, we shall illustrate them with one particular example: we shall 



show that statement (iii) of Theorem 13.11 namely 

-Ci(G^(n,K„)) ^p(a^), (8.5) 
n 

follows from the same statement restricted to the case that V is a vertex 
space. 

Let V = (5,/x, (x„)„g/) be a generalized vertex space, and let k„, n £ I, 
be a graphical sequence of kernels on V with limit k. As noted in Section |2l 
purely formal manipulations show that taking /x(5) = 1 loses no generality, 
although one must be a little careful with the introduction of normalizing 
factors. To spell this out pedantically, let I' = /i(5)/ = {fi{S)n : n G /}, let 
/i' = fj,/fj,{S) be the normalized version of the measure fi, and let V' be the 
generalized vertex space {S,iJ,', {ym)mer) defined by = x^/^(5)) so the 
sequences (x„) and (ym) are identical except for our rescaling of the index 
set. Writing k' for h{S)k and for /^(5)Km//^(5), for n G / the graphs 
G^{n, K„) and G^' {m, n'^), m = fj,{S)n, have exactly the same distribution. 
Also (as a consequence), the sequence k'^ is graphical on V' with limit k' , so 
our main results, in particular Theorem 13. 11 apply to the model (m, k^). 

Multiplying At by the constant factor /i(5) and dividing /i by the same 
factor leaves the branching process <I>k, and hence the survival probability 
p{k;x), unchanged, and so divides p{k) by a factor n{S). Multiplying the 
index variable n by /x(5) divides the left-hand side of 1)8. 5() by the same 
factor, so this relation for G^{n, tin) follows from the same relation for the 
model G^ (m, k^). 

Apart from the rather trivial normalization above, there are two further 
differences between vertex spaces and generalized vertex spaces. One is that, 
in the former, the index set is discrete, indeed a subset of the integers. This 
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makes very little difference: for any result of the form f{Gn) ^ a, n £ I, 
it suffices to consider 'thin' index sets I, say discrete sets {ii,i2, ■ ■ ■ ,} with 

h > 100 and it+i > 2it. Indeed, if f{Gn) a fails, there is an e > and 
an unbounded set I' C I with P(|/(G„) — a\ > e) > e for every n £ I', and 
then f{Gn) a fails along any subsequence of /', and hence along at least 
one thin sequence. Thus, in all our main results we need only consider 'thin' 
index sets. 

The final extension allowed by generalized vertex spaces is a that the 
number of vertices in x„ may be random, rather than exactly n. As noted 
at the start of the section, we may assume that each x„ is deterministic, 
and in particular that the number Vn of vertices is deterministic. This does 
not quite give a vertex space, as we need not have Vn = n: instead, taking 
A = S va 1)2. 4|) . we have Vn/n — > 1. Rescaling the indexing parameter 
as above, replacing n by Vn (after taking a subsequence if necessary) and 
multiplying k„ by a factor f^/n, does not affect the distribution of the graph, 
so the resulting kernels are still graphical with limit k. Hence, our results 
for vertex spaces apply. In particular, using (|<S.5j) for vertex spaces, we find 
that 

^Ci{G''{n,Kn))^p{K). 

As n ^ Vn, this implies 1)8. 5|) . 

We have shown that it suffices to prove 1)8. 5() . i.e., part (iii) of Theorem l3.11 
for vertex spaces in which the sequences x„ are deterministic; this was our 
aim in this subsection. Similar comments apply to all our results. 



9. The giant component 

In this section we prove our main results. Theorems 13.11 and 13.61 of Sub- 
section concerning the existence, size and uniqueness of the giant com- 
ponent in the random graph G^(n, The basic strategy will be to relate 
the neighbourhoods of a vertex of G^(n, k„) to the branching process, by 
exploring these neighbourhoods step by step. In the context of random 
graphs, this step-by-step exploration and comparison with a branching pro- 
cess, which now is standard, was perhaps first used by Karp [HI], who applied 
it to study the size of the giant component in random directed graphs; sim- 
ilar ideas were used earlier in other contexts, for example by Kendall |,63] in 
the study of epidemics. 

Let us first recall some notation. We shall work with the branching process 
Xk defined in Subsection 12.11 and studied in Sections |S1 and [HI As before, 
when the branching process is started with a single particle of type x we 
denote it X^{x). Unless explicitly stated otherwise, k will be a kernel on 
a vertex space V = (x„)„>i); most of the time we shall not consider 

generalized vertex spaces. We shall assume that k G L^, i.e., that // k < oo. 
Any additional assumptions on k (such as irreducibility) will be stated. 
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Recall that p>k{i^',x) is the probability that Xk{x) contains at least k 
particles in total (in all generations taken together), and pk{K;x) is the 
probability that 3Ck{x) contains exactly k particles in total, while p{k;x) 
is the probability that Xk(x) survives for eternity, i.e., for infinitely many 
generations. Starting the process with a particle of random type with dis- 
tribution /X, the corresponding probabilities for are p>k{K), pki^), and 
p{k). 

A key step in our proofs will be an additional result, relating the fixed-size 
components of G^{n, Kn) to the branching process X^. As before, we write 
Nf^{G) for the number of vertices of a graph G in components of order k, 
and A^>fc(G) for X]j>fe ^ji^)? number of vertices in components of order 
at least k. 

Theorem 9.1. Let (k;„) be a graphical sequence of kernels on a vertex space 
V with limit k. If k > 1 is fixed, then Nk{G^{n, Kn)) jn Pk{n)- 

Remark 9.2. In |25| I84j . results similar to Theorem 13.11 were proved (for 
special k but with more complicated dependencies) using a careful coupling 
of the discovery process of the random graph and the limiting branching 
process; here we shall do this coupling only in the simple case of finitely many 
types (Example 14. 3|) : the general case will then follow by approximation 
and monotonicity arguments. In particular, we shall show that any Gn = 
G^{n, k) contains a G'^ = G^ {n, k'), where k' may be regarded as a kernel 
defined on a finite set S, such that Ci{G'^) is no more than Op{n) smaller than 
Ci(G'n); a formal statement is given below. This reduces many questions 
concerning the very general model G^{n, k) to the much simpler 'finite-type' 
case. 

Proposition 9.3. Let (Kn) be a graphical sequence of kernels on a vertex 
space V with quasi-irreducible limit k. Given any e > 0, there is a ver- 
tex space V' = {S',p', (yn)n>i) with S' finite and a quasi-irreducible kernel 
k' on S' X S' with the following properties: p{n') > p{n) — e, the graphs 
Gn = G^{n,Kn) cind G'^ = G^'{n,K') can be coupled so that G'^ C Gn for 
sufficiently large n, and Gi{G',^)/n p{k'). 

The assertion concerning Ci(G^) will follow from the other assertions and 
Theorem 13.11 However, we shall prove Proposition 19.31 as a step towards 
the proof of Theorem 13.11 This is an example where quasi-irreducibility is 
forced on us: if we assume k is irreducible, we still cannot insist that k' is 
irreducible. 

We now turn to the proofs. We start by giving two elementary results 
that will be useful below. The first concerns N>kiG), the number of vertices 
of a graph G that are in components of order at least k. Note that for any 
graph G and any k > 1, 

Ci(G) <max{A;,iV>fc(G)}, (9.1) 
since if Gi{G) > k then N>k{G) > Gi{G). 
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Lemma 9.4. If k >2 and G, G' are two graphs with G C G' , then 

N>k{G) < N>k{G') < N>k{G) + 2k{e{G') - e{G)). 

Proof. If we add a single edge to G, the set of vertices belonging to compo- 
nents of orders > k will either remain the same or increase by the inclusion 
of one or two smaller components; hence N>k{G) will increase by at most 
2{k — 1). The result follows by iterating e{G') — e{G) times. □ 

Lemma 9.5. As k ^ oo, p>k{i^',x) \ p{k;x) a.e. x, and p>k{K) \ p{k). 

Proof. As K £ L^, (|5.1() holds a.e. x. By Lemma l5.ll we may assume that 
1)5. If) holds for every x. Then every particle in the branching process has 
a finite number of children, so a particle survives for eternity if and only if 
it has infinitely many descendants, and the result follows. □ 

Now we turn to the main part of this section, which concerns the connec- 
tion between the order of the giant component of G^(n, k) and the survival 
probability p{k). 

We begin by studying the case when S is finite. It will turn out that 
this case gives essentially everything, using our monotonicity results and 
Lemma 19.41 We use the notation in Example 14.31 We shall assume that 
we have a fixed k, as in Definition 12.71 rather than a convergent sequence 
Kn as in Definition 12.91 and Theorem 13. 11 In addition, we shall assume that 
the matrix k is irreducible and that p{{i}) > for every i. As observed by 
Soderberg [SS], we then can adapt the standard branching process argument 
for the classical random graph G{n,c/n), see, e.g., [SHI Section 5.2]. The 
details are as follows. 

Lemma 9.6. Let k be a kernel on the vertex space V = (5, ^u, (x„)„>i), 
where S = {1,2, .. . ,r}, and suppose that p{{i}) > for every i. Writing 
Gn for G^{n,K), if k is irreducible we have 

Ci{Gn)/n^ p{k). (9.2) 

Whether or not k is irreducible, for any fixed k we have 

N>k{Gn)/n^ p>k{K). (9.3) 

Proof. Recall that we have rij vertices of type i, i = 1, . . . , r, and that Ui/n ^ 
Pi = p{i\. Coupling the graphs (or just the x„) for different n appropriately, 
we may of course assume that Ui/n — > pi a.s. From now on we condition 
on Til, . . . , n,-; we may thus assume that ni, . . . ,nr are deterministic with 
Ui/n Pi. 

Let uj{n) be any function such that uj{n) oo and uj{n)/n 0. (Al- 
though it might seem more natural to fix uj{n) = log log n, say, we shall 
need this flexibility in the choice of uj{n) later.) We call a component of 
Gn '■= G^{n,K) big if it has at least uj{n) vertices. Let B be the union of 
the big components, so \B\ = N-^aj{n){Gn)- 
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Fix e > 0. We may assmne that n is so large that uj{n)/n < e^i and 
\ni/n — < e^i for every i; thus (1 — e)fiin < rij < (1 + e)fiin. We may 
also assume that n > maxK, as k is a function on the finite set 5x5. 

Select a vertex and explore its component in the usual way, one vertex at a 
time. We first reveal all edges from the initial vertex, and put all neighbours 
that we find in a list of unexplored vertices; we then choose one of these and 
reveal its entire neighbourhood, and so on. Stop when we have found at 
least u;(n) vertices (so x G B), or when there are no unexplored vertices left 
(so we have found the entire component and x ^ B). 

Consider one step in this exploration, and assume that we are about 
to reveal the neighbourhood of a vertex x of type i. Let us write for 
the number of unused vertices of type j remaining. Note that Uj > n'j > 
rij — w(n), so 

(1 - 2e)fij < n'j/n < (1 + e)fij. (9.4) 

The number of new neighbours of x of type j has a binomial Bi(n^-, K{i,j)/n) 
distribution, and the numbers for different j are independent. The total 
variation distance between a binomial Bi(n,p) distribution and the Poisson 
distribution with the same mean is at most p, see, e.g., the first inequality 
in Barbour, Hoist and Janson (1.23)]. Hence the total variation dis- 
tance between the binomial distribution above and the Poisson distribution 
Po(^K{i,j)n'j/rij is at most K{i,j)/n = 0(l/n). Also, by (|9.4j) . 

{l-2e)K{i,j)fij < K{i,j)n'j/n < {I + e)K{i, j)fj.j . 

Since we perform at most uj{n) steps in the exploration, we may, with an er- 
ror probability of 0{u>{n)/n) = o(l), couple the exploration with two multi- 
type branching processes X(i-2e)K and ^(i+g)^ such that the first process 
always finds at most as many new vertices of each type as the exploration, 
and the second process finds at least as many. Consequently, for a vertex x 
of type i, 

P>..(n) ((1 - 2e)K; i) + 0(1) < F{x eB)< p>^(„) ((1 + e)K- 1) + 0(1). (9.5) 

Note for later (after (|9.8)) ) that, as for any constant C the Poisson distribu- 
tion with mean C has probability o(l/n) of exceeding logn, the probability 
that we find more than log n new neighbours in one step is 0(l/n). It follows 
that the probability that we reach more than u){n) + logn vertices during 
the exploration is o(l). (Informally, we cannot 'overshoot' by more than 
logn.) 

Since oj{n) oo, by Lemma 1^31 we have p>u(n){i^' 'li) p(^';0 every 
kernel k' £ L^, so we can rewrite 1)9. 5() as 

p{{l - 2e)K; i) + o(l) < F{x £ B) < p{{l + e)K; i) + o(l). 

Letting e ^ we find, using Theorem 16.41 that if x is of type i, then the 
probability that the component containing x is big satisfies 



P(x e B)^ p{K;i). 



(9.6) 
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(Recall that we are conditioning on the types of the vertices, treating the 
numbers rii of vertices of type i as deterministic, and assuming that rii/n ^ 
/ij.) Summing over all vertices x we find 

11 \ ^ 

- E |5| = - V P(x G 5) = - V rii P(2; G 5 I X is of type i) 
n n ^-^ n ^-^ 

(9.7) 

Note that this limit is independent of the choice of ijj{n) in the definition of 
B. Hence, if we define B' using another such function uj'{n), it follows from 
1)9 .7(1 (considering u Au' and to V u') that 

E\B AB'\/n^O. (9.8) 

Next, start with two distinct vertices x and y, of types i and j, say, 
and explore their components as above, again stopping each exploration 
if we find uj{n) vertices. Assume for the moment that uj{n) is small, say 
a;(n) = logn. The probability that during the truncated exploration we 
find a connection between the two components is 0{uj{n)'^ /n) + o(l) = o(l). 
(Here we use the fact noted after (|9.2() . that we are not likely to overshoot: 
with probability 1 — o(l), at every stage, even after stopping the exploration 
of one component because it has become too large, the explored parts of the 
components contain at most ijj{n) +logn vertices.) As before, fix e > 0. For 
n large enough, ignoring the possibility of joining the truncated components 
of X and y, we can couple the two explorations as above with independent 
branching processes (with (1 — 3e)K for the lower bound) to obtain 

P>u{n) ((1 - 3e)K; i) p><^(n) ((1 - 3e)K; j) + o(l) 

< P(x,y G B) < p><^(„)((l + e)K;i)/9>^(„)((l + e)K;j) +o(l). 
Letting e ^ 0, it follows, as above, that 

F{x,yeB)^p{K;i)p{K;jy, (9.9) 
therefore, summing over all pairs of vertices x, y, we find that 

^ E = ^ ^x, y G B) + ^E\B\ ^ Yl P^f^JPi^'^ i)p(^-^ j) = Pi^? ■ 

x^y i,j=l 

Combining this and 1)9. 7() . we see that Var(|i?|/n) 0, and thus that 

\B\/n^p{K). (9.10) 

So far, we have assumed that io{n) was small. However, by (|9.8|) . having 
proved (|9.1U|) for one choice of Lo{n) it follows that 1)9. 1U() holds for every 
choice of uj{n) satisfying Lj{n) — > oo and a;(n) = o(n). 

For any choice of uj{n) with ui{n) oo and ui{n) = o{n), equation 1)9. 1U() 
gives the upper bound on the size Ci(G„) of the largest component claimed 
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in (|9.2j) . since Ci{Gn) < max{u;(n), \B\} by (|9.H1 . In other words, for any 
e > 0, 

|Ci(G„))|/n<p(K)+e (9.11) 

holds whp. 

To obtain the matching lower bound, it remains to show that all but Op(n) 
vertices in B belong to a single component. (We note that this is the only 
place where the irreducibility of k is needed.) We first consider the simpler 
case where > for every i and j; we shall return to the general case 

afterwards. We shall reveal the edges in G„ in two rounds: given < e < 1, 
we may take independent graphs Gn,o and Gn,i on the same vertex set, 
with the distributions of G(n, (1 — e)K) and G{n,eK) respectively, so that 
Gnfl U Gn,i ^ Gn- We shall think of Gn,o as containing almost all the edges 
of Gn, and Gn,i as containing a few edges we initially keep in reserve. 

Recalling that \S\ = r, set w(n) = rn"^/^, and let Bo be the union of the 
big components in Gn,o- From 1)9. 1U() . applied with (1 — s)k in place of k, 
whp we have 

\Bo\/n>piil-e)K)-e. (9.12) 

We claim that whp all vertices of Bq lie in a single component in G„. To see 
this, we condition on Gn,o and use the random graph Gn,i- let x,y £ Bq be 
vertices in distinct components Gx, Gy of Gn,o- As a; = r-n?^^, there are 1 < 
ijj such that Gx contains a set C* of at least v?^^ vertices of type i, and 
Gy a set Gy of at least n^/^ vertices of type j. Now the probability that 

does not contain a C* — G^ edge is (1 — k(«, j)/n)l*-^^ll*-^^l = exp(— 0(n-'^/^)) = 
o(n^^). As there are at most ii? pairs to consider, it follows that whp all 
vertices of Bq lie in a single component of G„, and hence, from 1)9. 12() . that 

\Gi{Gn)\/n>p{{l-e)K)-e (9.13) 

holds whp. 

The case when some K{i,j) may be zero is only slightly more compli- 
cated. This time, we replace Gn,i by r independent graphs Gn^i with the 
distribution of G{n,eK/r). Given C* and Gy as above, the irreducibility 
of K implies that there is a sequence of types, i = 11,12, ■■■ ,it = j, such 
that K{ii,ii^i) > for all /. As there are only r types, we may suppose 
that t < r + 1 (note that we may have i = j)- Let Ai = C*, and, for 
2 < / < t — 1, let A; be the set of vertices of type i/ adjacent to Ai_i 
in Gn,i-i- As l^il = r2(n^/^) and ^(11,^2) > 0, the expected size of A2 
is r2(n'^/^); furthermore, from a standard Chernoff bound, with probability 
1 — exp(— r2(n^/^)) = 1 — o(n~^) we have \A2\ > E |yl2|/2, say. Iterating, we 
see that for some c > we have |^t-i| > cre^/^ with probability 1 — o(n~^). 
Finally, we find an edge in G{n, t — 1) from At^i to Gy with very high 
probability, as above, establishing (|9.1,3j) in this case as well. 
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Letting e — > and using Theorem 16.41 the right-hand side of (l^.l!-}!) tends 
to p{k), so (|9.13j) proves the lower bound on Ci{Gn) claimed in (|9.2j) . Com- 
bining this with the upper bound (|9.1H1 . equation H9.2jl follows. 

To prove (|9.3j) . observe that if we replace uj{n) by a fixed number k in 
the argument leading to (|9.1fl|) above, and use Theorem 16 . 51 instead of The- 
orem I6.4( we obtain (|9.3|) instead of (|9.10|) . Note that this argument has 
not made use of the irreducibility of k either. □ 

Note that the first part of Lemma l9.6l and Theorem l6 . 2l imDlv Theorem l3.1l 

in the case when S is finite, > for every i £ S, k.^ = k for every n, 
and K is irreducible. 

We next consider the regular finitary case in Definition 14.41 let us recall 
the definition. A kernel k on a vertex space V is regular finitary if S may 
be partitioned into a finite number r of /i-continuity sets Si, . . . ,Sr so that 
K is constant on each Si x Sj. A //-continuity set is a measurable set A <^ S 
with fJ-{dA) = 0. We next prove an extension of Lemma 19.61 to this regular 
finitary case. 

Lemma 9.7. Let k be a regular finitary kernel on a vertex space V, and let 
Gn = G^{n,K). Then holds. If K is irreducible, then (|9.2|) holds. 

Proof. As noted in Example 14. 3[ the regular finitary case differs only in 
notation from the finite case, so it suffices to prove that the conclusions 
of Lemma 19.61 hold without the assumption that each > 0. Due to 

the generality of our model, we cannot just ignore sets of measure zero; see 
Remark 12.31 

Using the notation of Lemma let us say that a type i is had if //j = 0, 
and let S' := {i £ S : i \s not bad}. Conditioning on the sequences rej 
as in the proof of Lemma 19.61 if z is a bad type then ni/n 0. Hence, 
if we eliminate all vertices of bad type, we are left with a random graph 
G'^ = G{n' , (n' /n)K'), where k' is the restriction of k to S' xS' and n'/n — > 1. 
It is easily seen that p{n') = p{n), and p>k{K') = p>k{K). The expected 
degree of any vertex is at most maxK < oo, so the expected number of 
edges with at least one bad endpoint is o{n). Hence, Lemma l9 . 41 shows that 
for each fixed k, E(iV>fc(G„) - iV>fc(G^)) = o{n). Consequently, holds 
for Gn because it holds for G'^. 

Similarly, applying ()9.2() to G^, we see that if e > then Ci{Gn)/n > 
Ci{G'n)/n > p{k) — e whp. In the opposite direction, 1)9. 3|) yields that for 
every e > and k > 1, whp A^>fc(G„)/n < p>k{i^) + e, and 1)9. 1|) implies 
Ci{Gn)/n < p>k{K)+£ whp. Further, by Lemma P3l we have p>k{i^) \ /o(^) 
as k oo. Taking k large enough, we find that Gi{Gn)/n < p{k) + 2e whp, 
and (jni2I) follows. □ 

For technical reasons, we prove a slight extension of Lemma 19.71 to the 
quasi-irreducible regular finitary case; cf. Remark 12.121 
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Lemma 9.8. Let k be a regular finitary kernel on a vertex space V. Suppose 
that K is quasi-irreducible, i.e., that there is a ^-continuity set S' S such 
that K restricted to S' is irreducible and k = off S' x S' . Then (j9.2|) holds 
for Gn = G^{n,K). 

Proof. We may ignore all vertices with types not in 5', since they will be 
isolated, and consider the restriction of our model to S' . Note that we now 
have n' vertices, with n'/n ^(5'). The case = is trivial, and 

otherwise we can consider the normalized measure on S' and the 

kernel k' = fi{S')K on S' x S' . It is easily checked that Lemma 19.71 implies 
that holds for G n in this case as well. □ 

It turns out that most of the work is behind us; roughly speaking, to prove 
Theorem 13.11 we shall approximate with the regular finitary case and use 
Lemma l9.7l There are some complications, as we must ensure irreducibility 
of the approximations, but these have already been dealt with in Section [3 
when graphical sequence of kernels with quasi-irreducible limit k. 

Lemma 17.31 gives us a sequence of quasi-irreducible regular finitary kernels 
K~ approaching k from below. Furthermore, k~ < «;„ when n > m, so we 
may and shall assume that 

G(n,K„) C G(n,K„) (9.14) 

for n > m. This will allow us to apply Lemma 19.41 

We are now in a position to prove our main results. We start with the 
approximation result Proposition 19. 3( which shows that for many purposes 
we need only consider the finite-type case. 

Proof of Provosition \y.'A We use the kernels k~ constructed in Lemma l7.3l 
From Lemma I7.lj[ij ii) and Theorem 16.41 if m is large enough then k~ is 



quasi- irreducible and p{Km) ^ P(^) ~ ^- Fix such an m. We may regard 
the regular finitary kernel kernel on a finite set S', so the graph 

G^ (n, k^) has the required distribution for G'^. From Lemma I7..'^f iii)| and 
.14|) we can couple G'^ and Gn so that G'^ C G„ whenever n> m. Finally, 



from Lemma 19.81 we have Ci{G'^)/n /^(k') as required. □ 

Next, it will be convenient to prove a restatement of Theorem 19.11 

Lemma 9.9. Let (k„) be a graphical sequence of kernels on a vertex space 
V with limit k, and let k > 1 be fixed. Then 

N^k{G^{n,Kn))/n^ p>k{K). (9.15) 

Note that Lemma 19.91 immediatelv implies Theorem 19.11 as Ni^. = N>i^ — 

Proof. As before, to avoid clutter we suppress the dependence on V, writing 
G{n,-) for G^(n,.). We shall also write G„ for G{n,Kn) = G^{n,Kn). We 
may assume that k > 2, since the case A; = 1 is trivial. We use the k~ 
constructed in Lemma EH 
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For each m, by Lemma 19.71 we have 

N>k{G{n,k;^))/n^ p>kik:^). (9.16) 

Let e > 0. Since p>fc(K~) p>k{i^) as m ^ oo by Theorem 16.51 we can 
choose m such that p>k{k^) > p>k{n) — £■ Using (|9.14() . it fohows from 
that whp 

N>k{Gin,Kn))/n > N>k{G{n, k;^)) /n > p>k{K) - (9.17) 

proving the lower bound claimed in (|9.15() . 

To prove the upper bound, consider any ?] > 0. By monotone convergence, 



f^mi^^ y) dp{x) dp{y) k{x, y) dp{x) dp{y) 

52 J Js'^ 

as m ^ 00. Hence we may choose m such that J J ^ ~ // < v/^- We 
now fix this m. 

By ()2.11|) and Lemma l8. II fapplied to the bounded kernel k'^, we have 
E(e(G„)-e(G(n,K„)))/n 

Hence, for n large, 

E(e(G„) - e(G(n, K„)))/n < r?/2fc, (9.18) 

and by (I^TTH) and Lemma lOl E(iV>fc(G„)-iV>fc(G(n, k~ )))/n < r/. Hence, 
for large n, using also 1)9. 16() . 

P(iV>fc(G„)/n > p>fe(K) + 2e) < P(iV>fc(G„)/n > /)>fe(K„) + 2e) 

< ¥{N>k{G{n, k;^))/n > p>fc(K~ ) + e) 

+ P((iV>fc(G„)-iV>fc(G(n,/i7j))/n>e) 

<r] + rj/e. 

Letting 77 — > 0, we find N->k{Gn) < P>k{i'^) + 2e whp, which together with 
H9.17|l completes the proof of the lemma. □ 

As noted above. Theorem 19.11 is just a reformulation of Lemma 19.91 We 
are now ready to prove Theorem 13. 11 

Proof of Theorem V3.1\ As noted in Subsection l8.11 without loss of generality 
we may assume that V is a vertex space, rather than a generalized vertex 
space. As above we write G„ for G^(n, k„), and consider the approximating 
kernels k~ constructed in Lemma 17?^ 

We first observe that (|2.131) . Theorem 16.21 and Lemma 15.^1 i)| imply that 
p{k) < 1, and that p{k) > if and only if WT^W > 1. 

Next we prove the upper bound (|3.1() on the size of the giant component 
of Gn- Fix e > 0. By (|9.1|1 and Lemma l9.9[ for every fixed k > 1, whp 

Ci{Gn)/n < k/n + N>k{Gn)/n <£ + p>k{n) + e. (9.19) 
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By Lemma l9.f)[ as k ^ oo, /9>fc(K) \ p{k). Hence we may choose k so large 
that p>k{K) < p{n) + £, and (|9.19j) yields Ci{Gn)/n < pyki^) + 3e whp, 
proving (|3.1|) . 

For quasi- irreducible k, the lower bound on the size of the giant com- 
ponent claimed in (|.S.2|) follows from Proposition 19.31 Alternatively, we 
may argue as in the proof of Lemma 19.91 Fix e > 0. By Theorem 16. 4| 
Pi'^rri) ~^ Pi'^) as m ^ oo, so we can choose m such that pi^m) > P{^) ~ 
and then, by Lemma 19.81 and (|9.14|) . whp 

Ci(G(n, K„))/n > Ci(G(n, > p{k) - e. 

Together with this proves the convergence claimed in 1)3. 2(1 . 

It remains to prove part (i) of Theorem 13.11 When WT^^W < 1 we have 
p{k) = 0, so (|3.1|) yields Ci(G„) = Op{n), as required. Suppose that ||Tk|| > 
1. Recall that e., by Lemma IT.I-jhijj It follows as in the proof of 

Lemma l5.1()l that \\T~- 11 > 1 if m is large enough. Let us fix such an m. 
As fi;~ is of the regular finitary type, there is a finite partition S = IJi=o 
of S into ;U-continuity sets such that the restriction of k~ to S-i x Si is 
irreducible for 1 < z < r, and k~ is zero a.e. off IJi=i '^i ^ ^i- (This can 
be regarded as an application of Lemma 15.171 with S finite. However, the 
lemma is trivial in this case.) As T-- operates separately on each Si, we 
have llr-- II = maxj ||T„/||, where ||T„/|| is defined either on the generalized 
ground space //I5.), or, equivalently, by extending to 5 x 5 by setting 
K'-{x,y) = li X ^ Si or y ^ Si. In particular, there is an i with ||T^'|| > 1. 
Extending k[ to S x S as above, k[ is a supercritical quasi-irreducible kernel 
on S of the regular finitary type, with k„ > > k[ for large n. Hence, by 
(|9.14() and Lemma 19.81 we have 

Ci(G„)/n > Ci(G(n,K^)/n ^ > 0, 
and Ci{Gn) = 0(n) whp follows, completing the proof. □ 

We next prove Theorem 13.61 showing that the second largest component 
has size Op{n). 

Proof of Theorem AS.fA Let be a graphical sequence of kernels on a (gen- 
eralized) vertex space V with quasi-irreducible limit k, and uj{n) a function 
satisfying uj(n) — > 00 and uj(n) = o{n). Our task is to show that 

N>u,{n){Gn) ■■= ^ Cj{Gn) = np{K) + Op{n). 

j>l: Cj{G„)>uj{n) 

Then (|3.5j) follows by Theorem l3.11 In turn, (|3.6j) follows immediately, taking 
uj{n) = logn, say. As before, we may assume that V is a vertex space. 

Let e > 0. For an upper bound on N-^u:{n){Gn), fix a large k such that 
p>k{i^) < p{i^) + £• For large n we have uj{n) > k and thus by Lemma l9?9l 
whp iV>^(n)(G„)/n < N>k{Gn)/n < p{k) + 2e. 
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For a lower bound, assume that p{k) > 0. Then, by Theorem IIIIL whp 
Ci{Gn) > ^p{K)n > uj{n), so by Theorem 13. II again, whp A^>^^(n)(Gn)/^^ > 
Ci{Gn)/n > p{k) — e. This is trivially true if p{k) = too. 

Since e was arbitrary, the proof is complete. □ 

We now turn to a result giving the distribution of the types of the vertices 
making up the giant component; to state this, we need some more definitions. 

Let Ci{Gn) be the largest component of G„ = G^(n, i.e., the com- 
ponent with most vertices, chosen by any rule if there is a tie. (Thus, if a 
sequence (G„) has a unique giant component, then Ci{Gn) is this giant com- 
ponent.) Let i/^ := ^ X^jgCi(Gn) random measure with total mass 
Ci{Gn)/n that describes the distribution of the points Xj corresponding to 
the vertices in the largest component. We equip the space of finite positive 
Borel measures on S with the weak topology; see Appendix As before, 
we write p^ for the function defined by Pk{x) = p{k.; x). 

Theorem 9.10. Let {ku) be a graphical sequence of kernels on a (gener- 
alized) vertex space V with quasi-irreducible limit k. Then z/^ p^ in the 
space of finite measures on S with the weak topology, where p^ is the mea- 
sure on S defined by dpf^ = p^dp. In other words, for every p-continuity 
set A, 

i^liA) = -#{i G Ci(G„) -.XiGA}^ p^{A) = [ p{K-x)dp{x), (9.20) 
n J A 

where Gn = G^{n,Kn)- Furthermore, if f : S ^M. is continuous p-a.e. and 
satisfies 



n 



then 



iey{G„) 

1 



\f{x^)\ ^ ljf\dp< oo, (9.21) 



n 



S JS 



/(x,)^ / fdp^= / f{x)p{K;x)dp{x). (9.22) 



ieCi(G„) 

In particular, 1)9. 22(1 holds for every bounded and p-a.e. continuous / : 5 — > 
R. 

Condition 1)9. 21() is very natural and often easy to verify; for example, if V 
is a vertex space in which the Xj are i.i.d., as in ExamDle l4.81 or a generalized 
vertex space in which Poisson process, as in Example 14.91 then 

H9.21() holds for every integrable / by the law of large numbers. Similarly, 
if 5 = (0, 1] and Xj = i/n, then 1)9.21(1 holds for every decreasing integrable 
positive /. Note that some restriction on / is needed for H9.22() : it is not 
hard to construct an example where H9.22() fails, and so does (|9.21|) . 

Proof. We begin by proving the first statement. We proceed in several steps, 
as before. Arguing as in Subsection 18.11 we may assume without loss of 
generality that V is a vertex space. 
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First we assume that the conditions of Lemma 19.61 are satisfied: S is 
finite, k is fixed and irreducible, and fii = > for every i. We use the 

notation of the proof of Lemma 19.61 in particular, uj is some function with 
(jj{n) —>■ CO and w(n) = o(n), and B is the set of vertices of G„ = G^{n, k) 
in 'big' components, i.e., components of order at least uj{n). 

Let Vi be the set of vertices of type i. The arguments leading to (|9.10|) 
in the proof of Lemma [9.61 yield also \B Ci Vi\/n ^ p{K;i)fii; see (|9.6() and 

If p{k) > 0, then the conclusion 1)9. 2() of Lemma 19.61 implies that whp 
Ci(G„) C B, and thus and (l9ln|l imply that \B A Ci(G„)|/n ^ 0. 

This is clearly true when p{k) = too, and implies that |Ci(G„) n Vi\/n 
p{K;i)pi for every i, which is exactly ()9.2U|1 . 

The result extends to the case when some pi = as before. Thus 1)9.20(1 
holds in the irreducible regular finitary case considered in Lemma 19.71 pro- 
vided A is one of the sets Si in the partition or a union of such sets. In fact, 
A may be any ^-continuity set, since we may replace the partition {Si} by 
{Si n A, Si\ A}, noting that all parts are /U-continuity sets. Similarly, the 
extension to the quasi-irreducible case is immediate, as in Lemma 19.81 

We now turn to the general case. Note that Pk{S) = p{n). Assume that 
p{k) > 0; otherwise the result is trivial (with p^ = 0) by Theorem K-i. II 

Fix a ^-continuity set A. Use a sequence of partitions Vm as in Lemma fT.ll 
and consider the finitary approximation k~ given by Lemma 17.31 for some 
m. Let be the random measure defined for G{n, k~ ). (As before, we 
suppress the dependence on V.) By the finitary case completed above, 

ui^iA) := E Ci{G{n,k7j) : G A} ^ p^-^{A) (9.23) 

for every fixed m. 

Let e > and choose m so large that p{k^) > p{k) — e and p(k~ ) > 
(see Theorem l6.4j) . Then, applying Theorem 13. II to and ()3.6|) of Theorem 
13.61 to K, whp Ci(G(n, > ^p{k^)n > C2(G„). Recalling the coupling 
(|9.14() . it follows that the largest component of G{n, k~ ) is contained in the 
largest component of G„, i.e., Ci(G(n,K~)) C Ci(G„), and thus ^ ^n- 
Consequently, from (|9.23() . whp 

ul{A) > ul^iA) > p,-{A) - e > p,{A) - 2e, (9.24) 

because p^{A) - Pf,-iA) < p^{S) - Pf,-iS) = p{k) - p{k;;,) < e. 

Since S\A also is a /x-continuity set, we may replace A by S\ A in l|9.24|) 
and obtain that whp 

i^i{S\A)>p,{S\A)-2e. 

Since ^niS) = C'i(G„)/n and Pk{S) = /9(k), this and Theorem 13.11 show that 
whp 

vi{A) = Ci{Gn)/n - vl{S \ A) < p{k) +e- p^{S \A) + 2e = p^{A) + 3e. 
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This and (limi) yield u}^{A) so we have shown that H9.2U() holds 

for this A. We have shown that (|9.2U|) holds for an arbitrary /i-continuity 

set A, which yields /x^ by Lemma lA.21 

Turning to the second part of the lemma, note that the left-hand sides 
of (|9.21|) and (|9.22|) are equal to / |/| dun and / f dv\, respectively. If / is 
bounded and fi-a.e. continuous, Lemma lA. 21 thus shows that these relations 
follow from (|2.4|) and (|9.2fl|) . respectively. 

To deduce (|9.22j) from (|9.21|) for unbounded /, we use the truncations 
/m := (I/I AM) sign(/). Let e > 0. By monotone convergence, / I/a/ | dfi — > 
f \f\ dfi as M ^ oo. Thus, we can choose M such that / I/a/ | dfi > f \ f\ dfi— 
e. Since 1)9. 21() holds for bounded /x-a.e. continuous functions, it holds for 
/a/, so 



- E {\f{xi)\-\fM{xi)\)^ j \f\dfi- j \fM\dfi<e. 
Hence the left-hand side is at most e whp. Consequently, whp 



- y] f{xi) - - V /A/(xi) 
ieCi(G„) ieCi(G„) 



- \f{xi) - fM{xi 



< 

n 

j6Ci(G„) 



E |/(^^)-/m(^.)| = ^ E {\fi^i)\-\fM{xi)\)<e, 
iGy(G„) jey(G„) 

with the first two inequalities holding unconditionally. Note that 

/ fdfi^- / /m dfi^ < \f - JmI dn= I/I dfi- I/a/I dfi < e. 
Js Js Js Js Js 

Since e > is arbitrary, and (|9.22|) holds for each /a/, relation ()9.22() for / 
follows by a standard 3e-argument. □ 

10. Edges in the giant gomponent 

The main aim of this section is to prove Theorem 13.51 which claims that 
if Kn is a graphical sequence of kernels on a (generalized) vertex space V 
with quasi-irreducible limit k, then ^e{Ci{G^ {n, Kn))) ^ C(^)) where C{k) 
is defined in 1)3. 3|) as 

C(k) jj^^nix,y){p{K;x)+p{K;y)-p{K;x)p{K;y)) dfi{x)dfi{y). (10.1) 

Before turning to the proof, we briefly examine the behaviour of C('*)) 
giving two alternative formulae for C('^^)) together with upper and lower 
bounds in terms of p^. 

Prom the symmetry of n and the definition (|2.14|) of T^^, ()10.1|) is equiva- 
lent to 

((i^) = - p^,/2)T^,p^ dp, 
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where, as usual, /o^ is the function defined by Pk{x) := p{k;x). By relation 
?2l) of Theorem 16.21 and the definition of in (12.171). it follows that 



Writing G„ for {n, Hn), note that the assumptions of Theorem 13.51 
include convergence of the expectation of e{Gn)/n. As shown in Proposi- 
tion 18. 9| an easy consequence of these assumptions is that 



P 1 



e{Gn)/n ^2jJs2 ^^"'^^ 

In the light of H10.3() , relation 1)3.4(1 is equivalent to the assertion that number 
of edges not in the giant component is 



(1 - p{k; x))k{x, y){l - p{k; y)) dp{x) dp{y) + Op{n). 
52 



In any connected graph, the number of edges is at least the number of 
vertices minus 1; hence > p{k). In fact. Theorem 16.21 has the following 
simple consequence. 

Proposition 10.1. Let k he a kernel on a (generalized) ground space {S,p). 
Then 

p{^) < ({i^) < + 1)/>(k) < ||T«||p(k). 

Furthermore, the first two inequalities are strict when p{n) > 0. 

Proof. If < s < 1, then s < (1 — s/2) ln(l/(l — s)), as is easily verified by 
computing the Taylor series. Thus, 

,(.;x)<fl-^)lnf ' 



■1 - p{i^;x) 

with strict inequality when p{k; x) > 0. Integrating with respect to p, the 
left-hand side becomes p{k), while, from (|1U.2|) . the right-hand side becomes 
C{i^)- Thus p{k) < C(k)) with strict inequality if p{k) > 0. 

In the other direction, if < s < 1, then (1 — s) ln(l/(l — s)) < s — s^/2, 
as can again be verified by computing the Taylor series. Hence, 

l-^)ln(^)<s-C+'^ln( ' 



.l-sJ 2 2 VI- 

Substituting s = p{k; x) and integrating, it follows that 

< (^p{n;x) - ^piK;xf + ^/)(K;x)ln(^ ^ dp{x), 

with strict inequality when p{k; x) > 0. Writing p^ for the function defined 
by Pk{x) := p{k] x), from (|6.2j) and the definition (|2.17j) of we have 

Ki-p(k;x)) = 
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It follows that 



C{k) < p{k) - ^{PfcPn) + \ {Pk,T^Pk) 




with strict inequality unless = p{k) = 0. 



□ 



Our proof of Theorem 13.51 will be very similar to that of Theorem 13.11 
except that we need to consider certain branching process expectations (t{k) 
and cr>j!c(K) in place of p{k) and p>k{n). In preparation for the proof, we 
shall relate C{k) to the branching process via (t(k). As before, we assume 
that K is a kernel on with k S L^; in particular, it is convenient here 

to normalize so that p{S) = 1. 

Let A he a Poisson process on S, with intensity given by a finite measure 
A, so that A is a random multi-set on S. If is a bounded measurable 
function on multi-sets on 5, it is easy to see that 



(This is a simple consequence of the well-known fact that the Palm distri- 
bution equals the distribution of ^ U {y} . To show (|lfl.4j) directly, note that 
we may construct A as follows: first decide the total number N of points in 
A, according to a Poisson Po(c) distribution with mean c = A(5). Then let 
(^i)i^i be a sequence of i.i.d. random points of S, each distributed according 
to the normalized form A/c of A, and take A = {ai, . . . ,aM}- Let be the 
measure (on finite sequences of points in S) associated to (oj)^^, and let u' 
be the measure with density N dv. Recalling that if Z has a Po(c) distribu- 
tion, then k¥(Z = k) = cP(Z — 1 = k), we find that u' /c may be constructed 
by taking — 1 to have a Po(c) distribution, and then taking the i.i.d. 
as before, or, equivalently, by constructing a sequence according to v and 
appending a new random point with the distribution A/c. Neglecting the 
order of the points, (|10.4|) follows.) 

Let X(x) denote the first generation of the branching process Xi^{x). Thus 
X{x) is given by a Poisson process on S with intensity K{x,y) dfi{y). Sup- 
pose that (|5.1j) holds, so X{x) is finite. Let ct^k; x) denote the expectation 
of |A(x)|l[|j£K(3;)| = oo], recalling that under the assumption (|5.1j) . the 




(10.4) 
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branching process Xk{x) dies out if and only if < oo. Then 

K{x,y)dfi{y) - a{K;x) =E{\X{x)\l[\X^{x)\ < oo]) 

= eI\x{x)\ n i^-pi'^;^)) 

\ zex{x) 



L 



K{x,y){l - p{K;y))E I ]J {1 - p{k; z)) \ dp,{y) 

\z(iX(x) I 

k{x, y){l - p{k; y)){l - p{k; x)) d^i{y). 

Here the penultimate step is from 1)10.4^ : the last step uses the fact that the 
branching process dies out if and only if none of the children of the initial 
particle survives. Writing X for the first generation of X^, let 

o-(k) ■.= E{\X\1[\X^\=oo]) = j a{K;x)dp{x). 
Then, integrating over x and subtracting from JJ K{x,y), we obtain 



o-(k) = J J^^ K{x,y){l - (1 - p{k;x)){1 - p{K-y))) dp{y)dp{x), (10.5) 
i.e., (t(k) = 2C(k), where C,{k) is defined in (|3.3|) . 

Lemma 10.2. Let k he a quasi-irreducible kernel on a ground space {S,p), 
with K € . If (/^n)i° is a sequence of kernels that increase to k a.e., then 
(^{i^n) cr(K) < oo. 

Proof. This is immediate from Theorem I6.4r i) , (|10.5() , the fact that Jj k. < 
oo, and dominated convergence. □ 

As we shall see next, (t(k) is the limit of the expectations 

a>fc(K) :=E(|X|1[|X«| > A;]). 

Lemma 10.3. With k £ fixed, 

'^>kii^) \ (^{f^) as k ^ oo. (10.6) 

Proof We have \X\ > \X\1[\X^\ > k] \ \X\1[\X^,\ = oo]. As E\X\ = 
JJ n{x,y) < oo, the result follows by dominated convergence. □ 

Using the above lemmas we can prove Theorem 13.51 As the argument is 
very similar to that for Theorem 13. H we give only an outline. 

Proof of Theoreni AS.,51 As usual, we may assume without loss of generality 
that V is a vertex space. Let M>k{G) denote the number of edges of a graph 
G that lie in components of order at least k. 

We start with the case when S is finite and n is irreducible, writing G„ 
for G^(n, Kn). Let d{x) denote the degree of a vertex x of G„. Using the 
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local coupling of the neighbourhood of a random vertex x to the branching 
process described in the proof of Leninia l9^ considering ¥,{d(x)l[x £ B]) 
in place of ¥{x S B), the proof of Lemma 19.61 yields the relations 

2e(Ci(G„))/n^a(K) (10.7) 

and 

2M>k{Gn)/n ^ a>fe(^j), (10.8) 

corresponding to H9.2() and 1)9.3(1 . As before, the same formulae in the quasi- 
irreducible regular finitary setting of Lemma 19.81 follow . 

To complete the proof, we consider the approximating kernels k~ con- 
structed in Lemma f7.3l By Lemma l7.c|hi)| and Lemma fl0.2l we have o"(k~ ) 
a{K). Applying (|10.7() to and using the coupling G^{n, k^) C G^{n, Kn), 
n > m, it follows that for any e > 0, 

eiCi{Gn))/n>a{K)/2-e (10.9) 

holds whp. This is exactly the lower bound claimed in (j3.4j) . 
For the upper bound, we claim first that, for each fixed k, 

2M>k{G^{n,Kn))/n ^ a>kiK). 

The argument is exactly as for (|9.15j) . except that in place of ()9.18|) we show 
that there is an m for which E(e(G„) — e{G{n,k^)))/n < 77/(2/?^), and in 
place of Lemma f9.4l we use the fact that, for A: > 1, adding an edge to a 
graph cannot change M>k by more than 2('^2^) -|- 1 < k"^. The rest of the 
proof is as for Theorem 13. H using 

e(Ci(G„))/n < k^/n + M>k{Gn)/n 

in place of ()9.19|) and Lemma I1U.3I instead of Lemma 19.51 □ 

11. Stability 

This section is devoted to the proof of the 'stability' result, Theorem 13. 9| 
which states that deleting a few vertices and their incident edges, and then 
adding or deleting a few edges, does not change the size of the giant com- 
ponent of Gn = G^ {n, Kn) significantly. As usual, without loss of generality 
we may restrict our attention to the case where V is a vertex space; we shall 
return to this later. For the moment, we shall ignore vertex deletion; our 
aim is thus to prove the following special case of Theorem 13.91 

Theorem 11.1. Let {Kn) be a graphical sequence of kernels on a vertex 
space V with irreducible limit k, and let Gn = G^{n,Kn)- For every e > 
there is a 5 > ( depending on k) such that, whp, 

(piK) - e)n < CiiG'n) < {p{k) + e)n (11.1) 

for every graph G'n on V{Gn) = [n] with e{G'n A Gn) < 5n. 
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We shall see later (at the end of Subsection lll.lj) that Theor em ll-} . 91 follows . 
As noted in Subsection 13.21 to prove Theorem 111.11 it suffices to consider 
separately the cases where edges are added and where edges are deleted. 
More precisely, as G'„ n G„ C C G'^ U Gn, it suffices to prove the upper 
bound in (|ll.lj) for G'^ 5 G„, and the lower bound for G'^ C Gn- 

The upper bound is easy. Indeed, by Lemma 19.51 p>k{i^) \ p{i^) as 
k ^ oo. Thus, given e > 0, we may choose k such that p>k{K') < p{i^) +e/3. 
By LemmaEHl whp N>k{Gn) < {p{k) + e/2)n. Taking 5 = e/Ak, it follows 
by Lemma [9.41 that whp N>k{G'n) ^ {pi^^) + ^)^) which implies the upper 
bound in (|ll.lj) . 

For the lower bound, our aim is to show that whp 

Gi{Gn-E)>{p{K)-e)n (11.2) 

for every E C E{Gn) with \E\ < 5n. 

We may assume that p{k) > 0, as otherwise there is nothing to prove. As 
in the proof of Theorem l3.11 it suffices to consider the regular finitary case; in 
fact, given e > 0, by ProDosition l9.31 there is a vertex space V with finite type 
space and a quasi-irreducible kernel k' on V' with p{k.') > p{k) — e/2 such 
that we may consider (n, k') as a subgraph of G„. It suffices to prove that 
there is a 5 > such that removing at most 6n edges from G^ {n, k') leaves 
whp a graph with a component of order at least (pin') — e/2)n. Replacing 
e by 2e, this is exactly (|11.2j) . but with G^{n,Kn) replaced by G^ {n,K'). 
Thus we may assume that Gn = G^{n,K), where k is a quasi-irreducible 
kernel on a finite set 5 = {1, 2, . . . , r}. In fact, by rescaling, as in the proof 
of Lemma l9.81 we may assume that k is irreducible. Finally, as in the proof 
of Lemma 19.71 we may assume that p{i} > for every i, as there are Op(n) 
edges incident with types i with p{i} = 0. In other words, we may assume 
the setting of Lemma 19.61 We shall do so for the rest of this section; thus 
Gn = G^ {n, k), where V = {S, p, (x„)„>i) is a vertex space, and 

5 = {1, 2, . . . , r}, > Vi, k is irreducible, and ||Tk|| > 1. (11-3) 

In a paper studying the bisection width of sparse random graphs, Luczak 
and McDiarmid [SII proved (|11.2|) for the Erdos-Renyi case, where |5| = 1 
or K is constant. Their proof adapts easily to the finite-type case, from 
which, as shown above, Theorem 111.11 follows. We present this proof in 
Subsection 111.11 

A different, perhaps more natural, approach to proving (|11.2|) is to work 
with the branching process , using the coupling of vertex neighbourhoods 
in Gn with jC^ to reduce (|11.2j) to an equivalent statement for the two- 
core. Lemma 111.101 below. The latter statement has a very simple proof 
in the uniform case. We present this approach here, in Subsection 111.21 
below, because the intermediate results, relating properties of the two-core 
to the branching process, are likely to be of interest in their own right. 
Unfortunately, while Lemma 111.101 can be proved in the general case by 
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branching process methods, our proof is rather comphcated. As the result 
follows from Theorem lll.il which can be proved more simply by the method 
of Luczak and McDiarmid, we omit the proof. A reader interested only in 
the proof of Theorem 1 1 1 . 1 1 can safely omit Subsection 111.21 

11.1. Counting cuts in the giant component. In this subsection we 
prove Theorem lll.H and then deduce Theorem l3.91 Apart from the straight- 
forward adaptations to non-constant k, the argument for Theorem 111.11 is 
that of Luczak and McDiarmid jJOj- We start with a deterministic lemma 
whose statement and proof are taken verbatim from [70] . 

Lemma 11.2. For any e > 0, there exist r/o = f?o(e) > and no such that 
the following holds. For all n > hq, and for all connected graphs G with n 
vertices, there are at most (1 -|- e)" bipartitions of G with at most rj^n cross 
edges. 

Proof. Let T be an arbitrary spanning tree of G. Any 2-partition S, S of 
T is determined uniquely by the corresponding set of cross edges, together 
with the specification for each cross edge of which of its endpoints is in S. 
For as T is connected, the cross edges specify a nonempty subset S* of S, 
and then S is the set of vertices v such that there is a path from v to one of 
the vertices in S* where this path does not use any of the cross edges. (If 
V G S then no path from v to S* can avoid the cross edges, and v £ S 
then any shortest path from v to S* avoids the cross edges.) Hence, since T 
has 71 — 1 edges, the number of 2-partitions of T (and hence also of G) with 
at most rjn cross edges is no more than 

j;2^(")=0(n2^-((l-ry)i-V)""), 

j<'r]n 

assuming rj < 1/2. Now let e > 0. As — > 0, 2^ / {{1 — rf)^'''^ rj'^) 1. Hence, 
for T] sufficiently small and n sufficiently large, there are at most (1 -|- e)*^ 
partitions with at most r]n cross edges. □ 

Recall our assumptions (|11.3|) . that 5 = {l,2,...,r}, > for every 
i, K\s irreducible and ||Tk|| > 1. As usual, we condition on x„, so we may 
assume that x„ is deterministic for every n, so there are Ui vertices of type 
i, with Ui/n — > > as ra — > oo. 

The main additional ingredient needed to adapt the proof of jJOl to non- 
constant kernels is the following simple lemma. 

Lemma 11.3. Suppose that the assumptions ()11.3|) hold. For any e > 
there is a 6 = 9{k, e) > with the following property. If n is large enough 
then, whenever V\, V2 CLfc disjoint sets of at least en vertices of Gn = G{n, n) 
such that Vi U V2 contains at least en vertices of each type, the expected 
number of edges from Vi to V2 in Gn is at least On. 

Proof. We assume that n > maxK. Let e' = mm{e/r,e/2}, and let 

9 = {e'fmm{K{i,j) : K{i,j) > 0} > 0. 
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There are types i and j such that Vi contains at least e'n vertices of type i, 
and V2 at least e'n vertices of type j. As k is irreducible, there is a sequence 
i = . . . ,it = j such that K{is,is+i) > for each s. For each ig, from 

our condition on V1UV2, one or both of Vi and V2 must contain at least e'n 
vertices of type ig- It follows that for some s, Vi contains at least e'n vertices 
of type is, and V2 contains at least e'n vertices of type ig+i- But then the 
expected number of edges from Vi to V2 is at least {e'n)'^K{is, is+i)/n > On, 
as required. □ 

Using Lemma lll.3l the proof of Lemma 2 in [201 adapts immediately to 
our setting. Note that we use different notation (in particular, Greek letters) 
from [70 , for consistency with the rest of the present paper. 

Proof of Theorem \ll.l\ As noted at the start of the section, it suffices to 
prove (|11.2|) . assuming that holds. 

Given (5, e > 0, by an {e,6)-cut in a graph G we shall mean a partition 
(W, W) of the vertex set of G with \W\, \W\ > e\G\, such that G contains at 
most 6\G\ edges from W to W. We know from Theorem l3.1l that ^Ci{Gn) ^ 
p{k) > 0, so proving (|11.2|) is equivalent to showing that for any e > there 
is a 5 = 6{e) > such that whp the giant component of Gn has no (e, (5)-cut. 

Given < 7 < 1, let Gi, (jr2 be independent graphs with the distributions 
of G{n, (1 — 7)k) and G{n,^n), respectively. We may and shall couple the 
pair (6*1,^2) with G„ ~ G{n,K) so that G1DG2 C Gn- (The union has 
almost the distribution of Gn', the only difference arises from the possibility 
of Gi and G2 sharing edges.) 

Fix e > 0. Recall that k is supercritical, so p(k) > 0. Furthermore, k is 
irreducible, so p{n\i) > for each i. By Theorem 16.41 for each i we have 
p{{l — 7)k; i) y p{k; i) as 7 — > 0. Let us fix a 7 such that 

pi{l--f)K;i) > {l-e/3)p{K;i) 

holds for every i. Thus, p{{l — 7)^) > (1 — e/3)p{K). 

Following (in this respect) the notation of [ZOl) let U and Ui denote the 
largest components of G„ and Gi respectively, chosen according to any rule 
if there is a tie. Then, by Theorems 13. II and 13. 6| the events 

Ai :={\Ui\ > {l-e/2)p{K)n} 

and 

A2 ■■= {Ui C U and \Ui\ > (1 - e/2)\U\} 

hold whp; for the condition Ui Q U, note that Ui must be contained in some 
component of G„ , and whp only U is large enough. 

Let ei = mm{p{K; i) p{i} : i £ S} /2 > 0. By Theorem I9.1UI the event 

A'l := {Ui contains at least ein vertices of each type i} 

holds whp. Without loss of generality, we may assume that e < ei. Let 
u = "f6{K,ep{K) /2), where 9 is the function appearing in Lemma lll.31 If n 
is large enough then, by Lemma [ll.3[ whenever A'^ holds, if we partition the 
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vertex set of Ui into two parts Vi, V2 each of size at least ep{K)n/2 < ein, 
then the expected number of edges in G2 from Vi to V2 is at least z/n. 

Continuing exactly as in [201, but keeping our notation for the relevant 
constants, let ?? > satisfy 



We claim that A2 Ci C A4. Indeed, suppose that A2 holds and that U 
has an (e, 5)-cut into BuC. Let Bi = B n Ui and Ci = C nUi. Then Ui 
has a partition into BiU Ci, both and \Ci\ are at least 



and the number of cross edges is at most S\U\ < 25\Ui\, so ^4 holds, proving 
the claim. As A2 holds whp, and our aim is to show that P(^3) — > 0, it thus 
suffices to show that ^(^4) 0. 

Let us condition on Gi, assuming that A = AiCiA'^ holds. By Lemma fll.21 
and our choice of 6, there are at most (1 + r/)" (e/2, 25)-cuts of Ui in Gi. 
Consider any one such cut, partitioning Ui into B U C, say. Let X2 be 
the number of edges of G2 from B to C. Recalling that Gi and G2 are 
independent, as noted above, E(X2) > z/n > 4(5n. As X2 has a binomial 
distribution, a standard Chernoff estimate implies that 

F{X2 < 26n) < F{X2 < E{X2)/2) < exp(- E(X2)/8) < exp(-i/n/8). 

As G2 ^ Gn, the probability that B U G is an (e/2, 2(5)-cut of Ui in G„ is 
at most exp(— z/n/8). Hence, conditional on Gi and assuming that A holds, 

F(^4 I Gi) < (1 + ?7)" exp(-i/n/8) < (1 + r/)"(l + 2r/)-" = o(l). 

As the estimate above holds uniformly for all Gi such that A holds, it 
follows that P(A4 \ A) = o{l). As A holds whp, this shows that F^A^) 0, 
as required. □ 

As noted earlier, it is easy to deduce Theorem 13.91 from Theorem 111.11 
Recall that the only differences between these results are that in Theorem l3.9l 
we allow V to be a generalized vertex space, and we allow the deletion of 
vertices as well as the addition and deletion of edges. 

Proof of Theorem AS.fA We first show that, as usual, we lose no generality by 
assuming that V is a vertex space. Although this is not obvious at first sight, 
the general arguments in Subsection 18.11 apply. Indeed, the only potential 
problem arises when we condition on the sequences (x„,), since 6 might 
depend on (x„). However, fixing e and defining Xn as the smallest number 
of changes (edge/ vertex deletions or edge additions) that can be made to Gn 
to obtain a graph G'^ for which (|3.7|1 fails, then Theorem 13.91 states exactly 




riQ{rj) (from Lemma lll.2p . Let 



e\U\-{\U\-m)>em/2, 
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that, for any e > 0, we have n/Xn = 0(1) whp. As noted in Subsection l8.lL 
in proving that any function of Gn is 0(1) whp, we may assume that the 
sequences (x„) are deterministic, by conditioning and applying Lemma lA.51 
From now on we assume that V is a vertex space. Turning to vertex dele- 
tion, given an e > 0, let J > be such that the conclusion of Theorem lll.il 
holds. By Proposition 18.111 there is a 5' > such that the event £ that 
any 5'n vertices of Gn are incident with at most 5n/2 edges holds whp. Set 
5" = min{5',(5/2}. 

Let G'^ be any graph obtained from G„ by deleting at most 5"n < d'n 
vertices, and then adding and deleting at most 5"n < 6n/2 edges. If £ holds, 
then replacing the deleted vertices as isolated vertices to obtain a graph G'^ 
on V{Gn), we have 

\E{G'^) A E{Gn)\ < Sn/2 + 6n/2 = 6n. 

Hence, by Theorem 1 11.11 whp every such G'^ satisfies (|11.1|) . which is exactly 
(|3.7j) . This completes the proof of Theorem 13.91 □ 

The above proof of Theorem 1 11. II is much simpler than any proof we have 
been able to find based directly on branching process methods. However, the 
branching process approach does give additional insight into the relationship 
between the giant component and two-core of Gn and the branching process 

11.2. Branching process analysis of the two-core. Throughout this 
subsection we work with a kernel k on a vertex space V satisfying the as- 
sumptions (|11.3|) . As usual, we assume without loss of generality that the 
number rii of vertices of each type i is deterministic, with rii/n — > /^({i}) as 
n — > oo. The cornerstone of the branching process approach is the following 
form of the coupling between the neighbourhood exploration process in G„ 
and the branching process jC^. 

Lemma 11.4. There is a function Lq = LQ{n) oo such that we may 
couple the neighbourhood exploration process of a random vertex v of Gn = 
G^{n,K) with the branching process so that whp they agree for the first 
Lq generations. 

The sense of agreement is that there is a bijection between the vertices of 
Gn at distance at most Lq from v and the first Lq generations of mapping 
V to the initial particle and preserving type and adjacency, where particles 
in the branching process are adjacent if one is a child of the other. 

Proof. The argument is the same as the proof of (|9.5|) . except for the error 
bounds. Note that it suffices to consider the case Lq fixed. With Lq fixed, 
the total number of vertices encountered has bounded expectation, so we 
may abandon the coupling if we reach more than logn vertices, say, in the 
neighbourhood exploration. At every step, the number of unused vertices of 
type j is fi{j}n + o{n). Using this estimate in place of (|9.4|) . we may couple 
the number of new neighbours of each type found with a corresponding 
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Fo{K{i, random variable so as to agree with probability 1 — o(l). 

As the expected total number of steps is 0(1), the total error probability is 

0(1). □ 

If we have fJ,{j}n + 0(1) vertices of type j, Lemma |ll.4l holds for any 
Lq = o(logn). 

As in the proof of Lemma 19. 6( the coupling easily extends to the Lq- 
neighb our hoods of two vertices. Given G„, let v and w be chosen indepen- 
dently and uniformly at random from the vertices of G„. 

Lemma 11.5. There is an Lo(^) such that we uiay couple (O-^, 

with two independent copies X^, of the branching process so that whp 
the first Lq neighbourhoods of v and of w agree with the first Lq generations 
of Xk and of X'^, respectively. 

We omit the proof, noting only that for Lq fixed, the probability that v 
and w are within graph distance 2Lq is o(l). 

The next step is to find a way of applying the coupling results above 
to expectations of functions of the neighbourhoods. This will require some 
care, due to the possible large contribution to an expectation from the low 
probability event that the coupling fails. 

We consider functions f{v,G) defined on a pair {v,G), where G is a graph 
in which each vertex has a type from S = {1, 2, . . . , r}, and is a distin- 
guished vertex of G, the root. We call such a function an L -neighbourhood 
function if it is invariant under type preserving rooted-graph isomorphisms 
and depends only on the subgraph of G induced by vertices within a fixed 
distance L oiv. We define f{X^) by evaluating / on the branching process in 
the natural way: form a graph from the branching process as above, and take 
the initial particle as the root. Thus Lemma ll 1 .4l imDlies that we can couple 
{Gn,v) with so that f{v,Gn) = f{Xn) whp for every L-neighbourhood 
function /. 

Given an L-neighbourhood function /, let 

Sn:=- V f[v,Gn). 

n ^-^ 

V&V(Gr^) 

Also, for V and w independent random vertices of G„, let Xn = f{v,Gn) 
and Yn = f{w, G„). Note that E(5„) = E(X„) = E(y„). 

Theorem 11.6. Let V be a vertex space with finite type space S, and let k be 
a kernel onV. If f is an L-neighbourhood function such that sup„E(X^) < 
oo, then Sn ^ E(/(Xk)). 

Proof. Let X = f{X^). By Lemma lll.4l we may couple X„, and X^ so that 
P(X„ / X) 0, and hence Xn X. Since sup„ lE(X^) < oo implies that 
the variables Xn are uniformly integrable, it follows that 



E(Sn) = E(X„) ^ E(X); 



(11.4) 
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see |HH Lemma 4.11], for example. 

Let Y be an independent copy of X. From Lemma 111.51 we may couple 
(X„,y„) with {X,Y) so that P((X„,y„) / {X,Y)) 0. In particular, 

XnYn ^ XY. As 

mXnYnf) = E(X^y„2) < y/E(X4)E(y„4) = ^iX^) < c, 

for some C < oo, the variables XnYn are also uniformly integrable, so 
E(X„y„) ^ E{XY). But E(52) = E(X„y„) by hnearity of expectation, 
while X and Y are independent and have the same distribution. Thus 
E{Sl) E{XY) = E(X)2. Together with (tTOl) . this proves the result. □ 

Remark 11.7. Theorem II 1 .61 can be applied to any L-neighbourhood func- 
tion / bounded by a polynomial of the number of vertices within distance L 
of V. Indeed, the number of vertices at distance t from v in Gn is stochasti- 
cally dominated by the number Nt of particles in generation f of a Galton- 
Watson branching process in which the number of children of each particle 
has a Bi(n, max n/n) distribution. As a Bi(l,p) distribution is stochastically 
dominated by a Po(l, — log(l —p)) distribution, if n is large enough then Nt 
is dominated by Nl, the number of particles in generation t of the single-type 
Poisson branching process X2maxK- The probability generating function of 
Nl is obtained by iterating that of the Poisson distribution t times. As all 
moments of a Poisson distribution are finite, it follows that all moments of 
N[ are finite, so the fourth moment of any power of Nl is finite. 

Proposition 18 . 1 11 states that, given e > 0, there is a (5 > such that whp 
any set of at most 6n vertices of Gn is incident with at most en edges. A 
very special case of Theorem 111.61 gives an alternative proof of this result 
(under the more restrictive assumptions of the present section). Indeed, 
writing X for the number of particles in the first generation of X^, since 
EX is finite, we have E{X1[X > M]) ^ as M ^ cx). Given e > there 
is thus an M for which E(X1[X > M]) < e/3. Writing dc{v) for the degree 
of a vertex v in the graph G, let f{v, G) = dG{v)l[dG{v) > M]; clearly, / is 
a l-neighbourhood function. By Remark lll.7( Theorem 111.61 applies to /, 
so 

Sn = - y d(v) ^ E(X1[X > Ml) < e/3. 
n ^-^ 

veV(G„):d{v)>M 

Hence Sn < e/2 whp. Set 5 = e/(2M), and let W he a set of at most 5n 
vertices of G„ . Then 

^ d{w) < d{w)+M\W\ 

we.W w&W ■.d(w)>M 

< nSn + en/2 < en 

whenever Sn < e/2 holds, so whp any set of at most /3n vertices of G„ are 
incident with at most en edges. 

Our next result is a simple observation concerning short cycles. As before, 
we assume throughout that H11.3() holds. 
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Lemma 11.8. Let L = L{n) = o(logn). The probability that a random 
vertex v of Gn is within distance L of a cycle of length at most L is o(l). 

Proof. As all edge probabilities are bounded by p = maxK/n, the expected 
number of vertices v at distance d > from a cycle of length / > 3 is at most 
^i+dpi+d ^ (max k)'^'^. Summing over l,d < L, the expectation is o(n). □ 

The two-core C'^{G) of a graph G is the maximal subgraph of G with 
minimum degree at least 2. Equivalently, C'^{G) consists of those vertices 
and edges of G that lie in some cycle in G, or on a path joining two vertex- 
disjoint cycles. We shall work with the two-core G^ := G'^{Gn) of G„. To 
do so, we need to relate certain properties of G^ to the branching process 
Xk- In the light of Lemma 111.41 it will be useful to have a reasonably 
accurate (o(l) error probability) 'local' characterization of when a vertex 
V is in the two-core. We shall need similar results for vertices not in the 
two-core, but connected to it by short paths. Note that Theorem 13.61 gives 
us a corresponding characterization for the giant component: for a suitable 
L(n) oo, up to an error probability of o(l), a vertex v is in the giant 
component if and only if it is in a component of size at least L, and using 
Lemma lll.4( it is easy to check that whp when this condition holds the 
L-distance set, the set of vertices at graph distance exactly L from v, is 
non-empty, so there is a path of length L starting at v. (We omit the details 
as we use this statement only to motivate what follows, not in the proof.) 
For the two-core, we need two vertex-disjoint paths. 

Let L = L{n) be a function tending to infinity slowly, to be chosen below. 
For a vertex v of G„ and an integer d >0, let TC(i{v) be the event that v is 
at graph distance at most d from the two-core G^ of Gn- Thus TCq{v) is the 
event v £ G'^. Let LTGd{v) be the 'localized' event that there is a vertex w 
at distance d' < d from v joined by two vertex-disjoint paths of length L to 
vertices at distance d' + L from v. Thus, as we explore the neighbourhoods 
of V successively, LTCd{v) is the event that after d' < d steps we reach a 
vertex w (which we expect to be the closest vertex of the two-core to v) with 
two neighbours in the next generation each of which has neighbours for at 
least d — 1 further generations. 

Lemma 11.9. Let d>0 be fixed, and let v be a random vertex of Gn- Pro- 
vided L(n) tends to infinity sufficiently slowly, the event TGd{v) A LTCd{v) , 
i.e., the event that one ofTCd{v) and LTGd{v) holds but not the other, has 
probability o(l). 

Proof. Assume, as we may, that L = o(logn). We start with the case d = 0. 
Let us say that a cycle is short if it has length at most 2L. By Lemma lll.8| 
the probability that v is within distance L of a short cycle is o(l). If TCq{v) 
holds, i.e., v is in the two-core, then is in a cycle, or on a path joining 
two vertex-disjoint cycles. Assuming that v is not close to a short cycle, in 
either case we can find two vertex-disjoint paths of length L starting from 
V, so LTCq{v) holds. Hence ¥{TCq{v) \ LTGq{v)) = o(l). 
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The reverse bound is more difficult, as what we need is an equivalent 
for the two-core of Theorem 13.61 which states that almost all vertices in 
largish components are in a single giant component. In fact, we can use 
Theorem 13.61 Suppose that LTCq{v) \ TCo{v) holds. Note that v is not 
in a cycle by definition of the two-core. Let wi, W2 be two neighbours of v 
joined by vertex-disjoint paths to vertices xi, X2 at distance L from v. In 
Gn — V, there is no path from wi to W2', otherwise, there would be a cycle 
in Gn containing v. Hence, at least one of wi and ^2, let us say wi, is not 
in the giant component of Gn — v. (Here, by the giant component we mean 
the largest component, chosen according to any fixed rule if there is a tie.) 
But wi is in a component of size at least L, as witnessed by the path wixi. 
In summary, if LTCq{v) \ TCo{v) holds, so does the event E{v) that v is 
adjacent in Gn to a vertex w in an intermediate component of G„ — v, i.e., 
a component other than the largest having size at least L. As the random 
vertex v is chosen independently of Gn, the graph G„ — ?; is an (n — 1)- 
vertex graph to which Theorem 13.61 applies . Hence, taking uj{n) = L{n), by 
Theorem l3.6l the number of vertices w of G„ — t; in intermediate components 
is Op{n). Conditioning on Gn — v tells us nothing about the edges from v to 
Gn — V. As K is bounded, it follows that E{v) has probability o(l). Thus 
F{LTGq{v) \ TGo{v)) = o(l), completing the proof in the case d = 0. 

The general case follows using Proposition 18.111 If TCdiv) \ LTCd^v) 
holds, then v is within distance d -\- L of a vertex on a short cycle. Hence, 
by Lemma Hm F(TCd{v) \ LTCd{v)) = o(l). If LTGd{v) \ TCd{v) holds, 
then V is within distance d of a vertex v' for which LTCq{v')\TCq{v') holds. 
By the case d = above, Op{n) vertices v' have this property, and the result 
follows by applying Proposition 18 . 1 11 d times. □ 

We now turn to the branching process equivalents of the events TCd 
and LTCd- Considering the branching process 36^ (started with a single 
particle of random type), let DSd be the event that there is a particle in 
some generation d' < d which has at least two children with descendants 
in all future generations. Similarly, let LDSd^i be the event that there is a 
particle x in generation up to d, say in generation d', such that x has two 
children each of which has one or more descendants in generation d' + L, 
i.e., L generations after x. Note that LDSd^i depends only on the first 
d + L generations of the branching process. Suppose that L{n) grows slowly 
enough that Lemma ll 1 .41 applies with 2L in place of L. Then for any fixed 
d we have d + L < 2L for large enough n, and, with v a random vertex of 
Gn as before, from Lemma 1 11. 41 we have 

F{LTCd{v))=F{LDSd,Lin)) + o{l) as n ^ oo. (11.5) 

Note that for d and L fixed, the event LDSd.i, which is defined in terms of 
the branching process, does not depend on n, so W{LDSd,L) is a constant. 
For each d, as L increases the events LDSd,L decrease to the event DSd- 
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Hence, 

lim FiLDSd,L) = nDSd). (11.6) 

L— >oo 

Suppose now that L{n) tends to infinity sufficiently slowly that (|11.5|) and 
Lemma imi hold. Then, from (fTTHIl . F{LDSd,L{n)) ^ ^{DSd) as n ^ oo. 
Hence, from (m3]) . W{LTCd{v)) = ¥{DSd)+o{l). Finally, using LemmaHHi 
we obtain 

nTCd{v)) = nDSd)+o{i). 

Considering two random vertices f , w of G„ and using Lemma fl 1 . 5l instead 
of Lemmaim we obtain ¥{TCd{v) n TCd{w)) = ^{DSdf + o(l) similarly. 

Writing TCd for the set of vertices of Gn for which TCd{v) holds, i.e., 
for the set of vertices within distance d of the two-core, it follows that 
K\TCd\/n r{DSd) and E{\TCd\/nf (F{DSd)f, and thus 

\TCd\/n^F{DSd). (11.7) 

As IITkII > 1, the branching process 36^ is supercritical, so F{DSo) > 0. 
Hence, taking d = in (jll.Tf) . 

\V{G^)\/n^F{DSo)>0. (11.8) 

The reason for considering the two-core of Gn is that Theorem EIHl boils 
down to a statement about G^. Roughly speaking, the largest component 
of Gn consists of the two-core with some trees hanging off it, and it is easy 
to see what effect deleting edges from the trees has on the size of the largest 
component. The question is what happens when edges are deleted from the 
two-core. 

Lemma 11.10. Suppose that ^ll.'S\i holds, i.e., 5 = {1,2,... ,r}, fj.{i} > 
for each i, k is irreducible, and ||Tk|| > 1. Let G^ be the two-core ofG^{n, k). 
For any e > there is a 6 > such that the following statement holds whp: 
for any set W C V{G'^) with \ W\ > en and \ V{G'^)\W\ > en there are more 
than 6n edges of G^ joining W to V{G'^) \ W. 

In other words, under the assumptions of Lemma l9.6( if ||Tk|| > 1 then 
the two-core G^ cannot be cut into two large (size G(n)) pieces by a small 
set of edges. Note that the two-core itself is large by (jll.8|) . As the proof of 
Lemma lll.lOl is rather long, we first show that it implies Theorem 13.91 

Deduction of Theorem ],'^. fA from Lemma \ll.l(A We have already shown (at 
the end of Subsection 111.11) that, using Proposition 18.111 Theorem 13.91 can 
be deduced from Theorem 111.11 As noted at the start of the section, in 
proving Theorem 111.11 we may assume that (|11.3|) holds, and it suffices to 
prove (|11.2|) . From now on, let us fix the quantity e > appearing in (|11.2|1 . 

The events DSd form an increasing sequence, and their union is contained 
in the event S that the branching process survives (contains points in 
all generations). Also, S \ \J^DSd is the event that the process survives, 
but with only a single infinite line of descent. From basic properties of 
Poisson processes, starting from a particle of type x, the types its surviving 
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children, i.e., its children that have descendants in all later generations, form 
a Poisson process on S with intensity k{x, y)p{K; y) dfi(y). In particular, the 
number of such children is Poisson with some mean A(x) > 0. It follows 
that, conditional on a particle surviving, the probability that it has at least 
two surviving children is positive, and hence, as the type space S is finite, 
bounded way from zero. Hence F{S \ DSd) = 0, so F{DSd) / P(5') = 
/>(«:), and there is a constant D such that P(Z)5'£)) > /9(k) — e/3. From 
(|11.7|) . for any fixed D the set TCd of vertices within distance D of the 
two-core has size P(Z)S£i)n. + Op(n), so whp 

|rCi5| > (p(K)-e/2)n. (11.9) 

Let e' < P(D5o)/3 be a small positive constant to be chosen later, and 
let (5 = min{e', (5(e')}, where (^(•) is the function appearing in Lemma Ill.lUI 
Let us delete an arbitrary set E of at most bn edges from G„, leaving a 
graph Let C be the largest remaining connected part of . 
We claim that 

|F(G2)\T/(G2-)| <e'n 

holds whp. Note that \V{G'^)\ > 3e'n whp by (fTTHI) . If this inequality holds 
and every component of G'^ \ E has size at most — e'n, then there 

is a union H of components of G'^ \ E with between e'n and — e'n 

vertices: indeed, if the largest component has at least e'n vertices, this will 
do as H. Otherwise, every component has at most e'n vertices, and the 
smallest union H with at least e'n vertices will do. The existence of an H 
with the stated properties has probability o(l) by Lemma ll 1.101 proving the 
claim. 

Let X be the component of GJ^ containing G^~ . If u G TCd \ X then, as 
V € TC D , there is a path in Gn of length at most D from t; to a vertex of G^ . 
Taking any such path, as v ^ X, either the path ends in a vertex of G^\G'^~ , 
of which there are whp at most e'n, or it contains an edge of E, and hence 
contains an endvertex of such an edge; there are at most 26n < 2e'n such 
endvertices. In particular, whp all v G TCd \ X are within distance D (in 
Gn) of some set of at most 3e'n vertices. Applying ProDosition l8.11I Z) times, 
it follows that if we choose e' small enough, then whp \TCd \ X\ < en/2. 
Using nil. 9(1 it follows that whp 

C^{G'n)>\X\>{p{^,)-e)n, 

completing the proof of Theorem 13.91 □ 

It remains only to prove Lemma lll.lUl The uniform case (k constant) 
has a simple proof, presented below. 

Proof of Lem,ma \ll.l(A uniform case. In a moment we shall restrict to the 
uniform case; for now, we assume (|11.3|) . 

As ||Tk|| > 1, the branching process is supercritical, so p(k) > 0. From 
irreducibility, it follows that p{n]i) > for every i. Consider the event TS 
that the initial particle has exactly three children that survive. As the initial 
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particle has positive probability of having exactly three children, ¥(TS) > 0. 
Arguing as for (|11.8)) . one can show that the number of vertices of degree 
exactly 3 in is ¥(T S)n -\- Op{n); in fact, both statements are special cases 
of Lemma 111. 1 II below. 

We shall condition on the vertex set and (labelled) degree sequence of 
G^. In other words, we shall condition on the sequence d = {d{l), . . . , d{n)), 
where d{i) is the degree in G^ of the vertex i and d{i) = if i ^ V{G'^). Let us 
write for = \{i : d{i) > 0}|, m2 for e(G^) = ^ ^d{i), and n>3 for 

the number of i for which d{i) > 3. Note that whp n>3 > ¥{TS)n/2 > 0. 
Also, e{Gn) = 0{n) whp (for example, by Proposition 18. 9() . and G„ has 
maximum degree O(logn) whp. Thus there are positive constants ei and 
C, depending only on k, such that 

n>3 > Ein, 1712 ^ Cn2, and maxd{i) < Clogn, (11.10) 

~ i 

hold whp. 

From now on we consider the uniform case, where k is constant, or, 
equivalently, \S\ = 1. This is just the usual Erdos-Renyi random graph 
Gn = G{n, c/n), with c > 1. We condition on d, assuming, as we may, that 
the conditions 1)11.10(1 hold. In the uniform case it is easy to see that (given 
d) the graph G^ is uniformly distributed among all graphs with degree se- 
quence d. This is because any graph can be decomposed into its two-core 
and a collection of vertex-disjoint trees, each sharing at most one vertex 
with the two-core. Hence, any two graphs Hi, H2 with degree sequence d 
can be extended in exactly the same ways to graphs H[, H2 with vertex 
set [n] so that H[ has two-core Hi. As corresponding graphs H[, H2 have 
the same number of edges, they are equally likely in the model G{n,c/n). 
Summing over the possible extensions. Hi and H2 are equally likely to arise 
as G^. 

Let H be the random multigraph with degree sequence d generated by the 
configuration model of In other words, for each vertex i we take d{i) 
'stubs', and we pair the 2m2 stubs randomly, with all (2m2 — 1)!! pairings 
equally likely. For every pair in this pairing, we take an edge between 
the corresponding vertices. This generates a multigraph H with degree 
sequence d, where H may contain loops and multiple edges. Let Psimpie be 
the probability that H is simple. From our assumptions on d it is easy to 
check that Psimpie = exp(— o(n)). This very crude lower bound is all that 
we shall need; the much stronger bound exp(— 0((logn)^)) follows from 
(|11.10|) and the general results in McKay [ZSl- In fact, using the fact that 
^jd(i)^ = 0{n) whp, one can show that Psimpio = ©(I)- Given that H 
is simple, it is uniformly distributed among all simple graphs with degree 
sequence d, i.e., H has the distribution of G^. Hence, to show that G^ has 
a certain property whp it suffices to show that H has the property with 
probability 1 - o(psimpie)_ 

For W C V{G^) let W = V{G^) \ W. Let S{W) denote the set of 
stubs associated to W, so |5(VF)| = '^nzy/dii)- For a set S of stubs, let 
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S = S(y{G'^)) \ S, and let PcutiS,S) denote the probability that, in the 
random pairing, every stub in S is paired with another stub in S. If V{G'^) 
has a partition W, W with at most 5n edges between W and W, then there 
is an 5 C S{W) with \ 51 < 5n so that every stub in S is paired with 

another stub in S. Hence, the expected number of such partitions with W, 
W large is at most 

EC:= Yl E PcutiS,S), (11.11) 

WcV(G2):\W\,\W\>en SCS{W):\S{W)\S\<5n 

and it suffices to show that EC is o(psimpic) if we choose 6 small enough. 
Now Pcut{S, S") = if |5| is odd, and otherwise 

P^-^^^^^>- (2^2 - 1)!! -Vl^|/2j ■ ^^^-^^^ 

Every vertex of C^ has degree at least 2, so for any W we have |S'(Ty)|/2 > 
\W\. As, from (|11.1U() . there are at least ein vertices of degree 3 in G^, 
either \S{W)\/2^ \W\ + ein/4 or \S{W)\/2 > \W\ + em/A. It follows that 
whenever \W\, \W\ > en we have 

where a > is a constant depending only on ei and e, and hence only on k 
and e. Choosing 5 small enough, it follows that 



whenever S C S(W) with \S\ > \S{W)\ — 6n. Given w = \W\, there are 
at most choices for W. Given W, there are, crudely, at most m2(^^) 
choic es fo^ S C S{W) with \S{W) \ S\ < 6n. Hence, from ^TTU} . (tTTfel) 
and (UnU), 



£n<w<n2—£n 



Choosing 6 small enough, it follows that EC = 0{ii? exp(— 2an)), and hence 
that EC < exp(— an) for n large enough. As Psimpie = exp(— o(n)), this 
completes the proof in the uniform (|5| = 1) case. □ 

One might hope that any argument for the uniform case would adapt 
easily to the finite-type case. However, we have been unable to find a simple 
extension of the argument above. Our branching-process based argument 
for the general case is somewhat involved and rather lengthy, and we shall 
not present it. This is because adapting a proof due to Luczak and McDi- 
armid ^| gives the much simpler proof of Theorem 1 11. II in Subsection II 1.11 
which in turn immediately implies the general case of Lemma lll.lfll We 
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believe, however, that the results in this subsection are likely to be useful 
for determining other properties of the two-core. 

We close this section with a final result, Lemma 111.111 below, relating 
any 'local' property of the two-core to the branching process X^- This will 
require a little introduction. 

Recall that, by Lemma lll.4| if L — > oo sufficiently slowly then we may 
couple the 2L-neighbourhood of a random vertex v of G„ with the first 2L 
generations of the branching process so that they agree with probability 
1 — o(l). By Lemma lll.9| for almost every vertex v, v is in the two-core 
if and only if there are two disjoint paths of length L starting at v. This 
allows us to adapt the coupling, and hence Theorem 1 11.6( to the two-core. 

For V £ G'^ and i > 0, let Ff (f , G^) be the set of vertices of at graph 
distance t from v, and set Tt{v,G'^) = if u ^ G^. Note that any vertex 
on a path joining two vertices of G^ is in G^, so rt{v,G'^) is the t-distance 
set of V in the graph G^. Let be obtained from X^ in two steps: first, 
delete any particle that does not have descendants in all future generations. 
Then, if the initial particle has only one remaining child, delete everything; 
we write X^ = in this case. We obtain a certain branching process X^ 
having the following properties whenever X^ ^ 0: the first particle has at 
least two children, and every later particle at least one child. 

For constant D, ii L = L{n) oo then up to an o(l) error probability, 
the first D + L generations of X^ determine the first D generations of X^: 
consider surviving to generation D + L instead of surviving forever. Let v be 
a random vertex of G„. If L ^ oo sufficiently slowly then, by Lemma 111.91 
and Proposition 18.11) (applied D times), the probability that v is within 
distance of a vertex w for which one of TCq{w) and LTCq{w) holds but not 
the other is o(l). Using Lemma 111. 41 it follows that the G^-neighbourhoods 
(Fj(f,G^))^o of ^ random vertex v G Gn can be coupled with the first 
D generations of X^ so as to agree with probability 1 — o(l). Similarly, 
Lemma 111.51 implies its equivalent for G^ and X«;. Using these two results, 
an analogue of Theorem 111.61 follows. In the result below we take /(X^) to 
be zero when X^ is empty. The proof follows exactly that of Theorem II l.(i| 
so we omit it. 

Lemma 11.11. Let D he fixed, and let f = f{v,G) be a D -neighbourhood 
function bounded by a polynomial of the number of vertices within distance 
D of V. Then 

Sn:=- Yl fiv,G^)^EifiX^)). 

Of course, the condition on / could be replaced by a fourth-moment condi- 
tion as in Theorem II 1.61 As an immediate consequence of Lemma lll.lH we 
can describe, for example, the typed degree sequence of G^, taking G) 
to be the 1-neighbourhood function taking the value 1 when v has degree 
d and type i, and otherwise. Lemma Fl 1 . 1 1 1 can be used as the basis of a 
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proof of Lemma ril.lOL but as noted above, the details are rather involved; 
see the first version of this paper, at 
http : //arXiv . org/math . PR/0504589vl . 

Remark 11.12. Recently, Riordan j85j proved an analogue of Lemma lll.lll 

for the /c-core, again using a direct coupling of the neighbourhood explo- 
ration process with a branching process. 

12. Bounds on the small components 

In this section we prove Theorem 13.121 i.e., that the sizes of the small 
components of G„ = G^(n,K) are O(logn) whp under certain assumptions. 
As before, by the giant component in a graph G„ we mean the unique largest 
component, provided it has B(n) vertices - all other components are small. 
Thus, in the supercritical case (HTkH > 1), where there is a giant component, 
a small component is any component other than the largest, so our aim is to 
prove an upper bound on C2(G„). In the strictly subcritical case (WT^^W < 1), 
all components are small, and our aim is to show that Ci(G„) = O(logn) 
whp. 

We shall prove three results that together imply Theorem 13.121 namely 
Theorems 112. 5| [12.61 and 112.71 below. In the supercritical case, we shall also 
prove a more general result, Theorem ll2.1l below. describing the distribution 
of the graph formed from G„ by deleting the giant (more precisely, largest) 
component, Ci(G„). This description is in terms of another instance of our 
general model, involving the dual kernel k defined in Definition 13.151 

Theorem 12.1. Let (Kn) be a graphical sequence of kernels on a (gener- 
alized) vertex space V with quasi-irreducible limit k, with \\Tk\\ > 1. Let 
Gn = G^{n,Kn), and let G'^ be the graph obtained from Gn by deleting all 
vertices in the largest component Ci(G„). There is a generalized vertex space 
V = (yn)) with /2 given by dfi{x) = (1 — p{k; x)) dfj,{x) , such that G'^ 

and G^{n,K,n) can be coupled to agree whp. Furthermore, the sequence (k^) 
is graphical on V with quasi-irreducible limit n. 

If we wish, we can renormalize so that (5, fi) becomes a ground space; see 
the comment after Definition 13.151 However, the resulting graph still has a 
random number of vertices, so we cannot insist that V is a vertex space. 

Theorem 112. II is the natural generalization to our context of the old 'du- 
ality result' of Bollobas JHI for the Erdos-Renyi model G{n,c/n) that was 
the basis of the study of the phase transition there (see also Luczak [7T| . 
Janson, Knuth, Luczak and Pittel jSH]) and the books |16| I59j). 

Remark 12.2. We know that the random graph G^(n, k„) in Theorem 1 12. II 
cannot be supercritical, since otherwise Gn would have a second giant com- 
ponent. It may be critical, see Example 112.41 but is typically subcritical; 
one sufficient condition for subcriticality is given in the next result. 
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Theorem 12.3. Under the assumptions of Theorem \12.1\. if, in addition, 
ffg2 i^iXjU)"^ dfi{x) dfi{y) < oo, then G^{n,Kn) is subcritical. 

Proof. As usual, we may normalize so that = 1. The result then follows 
immediately from Theorem 16.71 □ 

Example 12.4. As in ExampleimS let 5 = {1, 2, 3, ... } with i^{k} = 2~^, 
and let xi, . . . , x„ be i.i.d. random points in S with distribution ^u. Let {£k)T 
be a sequence of positive numbers tending to zero, to be chosen below. Set 
k(1,1) = 4, K{k,k) = 2^ ioT k > 2 (instead of 2'=+^ in Example Ell), 
k) = K{k, 1) = Ek for k > 2, and K{i,j) = otherwise. 

Furthermore, again as in Example 14.121 let Hi^ be the subgraph of G(n, k) 
induced by the ~ Bi(n, 2~^) vertices of type k. Then, conditional on n^, 
each Hk has the distribution of the Erdos-Renyi graph G^Uf^, K{k, k)/n). As 
before. Hi is supercritical, which implies that G(n, k) is supercritical. 

Let A;„ — > oo slowly, and choose such that e^^ < n~^; to be specific, 
set kn := [log2logn] and Sk := exp(— 2'^+^). Then (|4.4|) implies that H^^ is 
a critical Erdos-Renyi graph: the edge probability is not exactly one over 
the number n^^ of vertices, but is (1 + o{n^ ' ))/nk^, which is within the 
"scaling window". It follows, as in Example 14.121 that whp C2(G(n, k)) > 

Ci{Hk„) = ep{nll')>n^/yiogn. 

It is easy to see, analytically from Theorem 16.21 or probabilistically from 
Theorem 19.101 that p{k) as k ^ oo. Consider the graph G^(n,K„) in 
Theorem 11 2. 11 the norm of the corresponding integral operator on L'^{'fl) 
is at least the norm when restricted to {k}, which is exactly 1 — p{k), k >2. 
Hence the norm is at least 1. By Remark 112.21 it follows that the norm of 

is exactly 1, i.e., that G^(n, k„) is critical. 

We now turn to the proofs of Theorems 13.121 and 112.11 starting with the 
subcritical case of Theorem 13. 121 which we restate below. 

Theorem 12.5. Let (Kn) be a graphical sequence of kernels on a (gener- 
alized) vertex space V with limit k. If k is subcritical, i.e., ||Tk|| < 1, and 
supj, y „ K„(a;, y) < oo, then Ci(G„) = O(logn) whp. 

Proof. As usual, we may assume that V is a vertex space; see Subsection l8.ll 
Consider first the case when Kn = n for all n and S is finite, or, equivalently, 
the regular finitary case. In this case, the result follows by comparing the 
neighbourhood exploration process of a vertex in the graph to a subcritical 
branching process: this comparison is similar to that made in the proof of 
Lemma [9.61 This time, setting uj{n) = ^logn, where A is a (large) constant 
to be chosen below, instead of the upper bound in (|9.5|) we claim that 

P(xGB)<p>^„)((l + 2e)K;i), (12.1) 

for all sufficiently large n. 

This can be proved by the comparison argument used for (|9.5|) . except 
that in the final step, instead of using the total variation distance between 
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the binomial and Poisson distributions, we note that if ^ denotes stochastic 
domination and p' = — log(l — p), so that p' = p + 0{p^) as p 0, then 
Bi(l,p) :< Fo{p'), and thus Bi(m,p) ^ Po(mp') for every m. Hence, for n 
large enough, Bi(n^-, ^(i, j)/n) ^ Bi{nj,K{i,j)/n) ^ Po[{l + 2e)K{i,j)nj). 
Note that in the argument leading to H9.5(l , and hence in our proof of ()12.1|1 , 
we do not assume that k is irreducible. 

For e small enough, the branching process X(i_|_2e)K is subcritical. There- 
fore, there is an a > such that 

p>fe((l + 2e)K;i) <e~"^ (12.2) 



holds for all i and all k > 1. Inequality (|12.2|1 is undoubtedly well known 
for finite- type Galton- Watson processes, but for the sake of completeness 
we sketch a proof. For z > let gz{x) := Ezl^^^^^l, where denotes 
the total population of the branching process. Using Ezi;^°(^) = e'^^'""^^ 
and independence of the Poisson numbers of particles of each type in the 
first generation, we have gz = ze'^''^^''^^\ If k is subcritical, then by the 
implicit function theorem this functional equation has a finite solution for z 
in a neighbourhood of 1, say for |z — 1| < S, and it follows by an argument 
similar to the proof of Lemma 15.61 that indeed Qzix) < oo for all x when 
z <l + 6. 

Taking A = 2/a for some a for which inequality 1)12.2(1 holds, and recalling 
(|12.1() . P(x £ B) < n~'^ follows, and so whp there are no vertices in B, i.e., 
there is no component of order ^logn or larger. 

For the general case, it suffices to bound k„ for large n from above by 
a subcritical finitary k'. Recalling that sup^y „^Kn{x,y) < oo, let k+ be 
defined as in Lemma 17.21 As \\TJ\ < 1, by Lemma 17.21 we have ||T^+ 11 < 1 
for m large enough, and we can take k' = k^. □ 

We now turn to the supercritical case, proving two results that together 



imply part (ii) of Theorem I.S. 121 

Theorem 12.6. Let (k^) be a graphical sequence of kernels on a (gener- 
alized) vertex space V with irreducible limit k. If k is supercritical, i.e., 
||Tk|| > 1, and\nix,y,ni^n{x^y) > 0, then C2{Gn) = O(logn) whp. 

Proof. As usual, we may assume that V is a vertex space. As the kernel 
K is supercritical, we have \\T^^\\ > 1, so there is an integer k such that 
(1 — l//c)||rK|| > 1. Fix such a k throughout the proof. 

Recall that Gn = {n, Kn) is constructed by choosing in an appropriate 
manner a (deterministic or random) sequence x„ = (xi, . . . ,Xn) giving the 
types of the vertices, and then constructing the edges using the kernel k„. 
Independently of Gn, let us partition V{Gn) = [n] into k subsets Si in a 
random way, by independently assigning each vertex to a random subset. In 
other words, we construct G„ (which has vertex set [n]) and then partition 
its vertex set into k classes Si. Let Sf := [n] \ Si. 

Let ^ be a very large constant, to be chosen later. We aim to prove that 
the event E that Gn contains a component with more than ^logn and at 
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most njA vertices has probability o(l). Every component C of Gn meets 
some Si in at least |C|/fc vertices. Let us say that a component C of Gn is 
had if it has at least Alogn and at most njA vertices, and meets Si in at 
least AXo^njk vertices. Then, as all Si are equivalent, it suffices to prove 
that whp Gn has no bad component. 

To this end, consider the subgraph G'^ of G„ induced by the vertices 
in S'l- Let /x' = (1 — l/k)fj,, and let be the (random) subsequence of x„ 
corresponding to those vertices i with i ^ Si. Note that V' = {S, fi' , (yn)n>i) 
is a generalized vertex space: condition (|2.2j) for V' follows from the same 
condition for V and the random choice of Si. Also, G'^ has exactly the 
distribution of G'^'(n, 

Since Ee(G^) = (1 — E e(Gn), and the sequence is graphical on 
V with limit k, it is graphical on V' with the same limit k. As k is irreducible 
on (S,n), it is irreducible on Let us write k' for the kernel k when 

viewed as a kernel on Thus T^' is an operator on L'^{S,ij.'), and, by 

assumption, ||Tk'|| = (1 — 1/A;)||Tk|| > 1. Let Ci be the largest component of 
G'j^ (chosen according to any rule if there is a tie). Then, by Theorem 18.11 
whp Ci contains at least p{K')n/2 vertices, and p{k') > 0. 

From now on, we shall assume that Ci has at least p{K')n/2 vertices, and 
choose A so that 1/A < p[n')/2. If a component C of G„ is bad, then it 
cannot contain Ci, and hence sends no edges to Ci. Let be the spanning 
subgraph of Gn obtained from Gn by deleting all edges between Ci and 
vertices in Si. If C is a bad component of Gn, then all edges of C are present 
in G^, so C is a component of G^. Hence, the probability that G„ has a 
bad component is bounded by o(l) (the probability that Ci is too small) 
plus the probability that some component C 7^ Ci of G^ containing at least 
A\ogn/k vertices of Si sends no edges to Ci in G„. Conditioning on x^. Si 
and G^, we have not tested any edges between Ci and Si, so these edges are 
present independently, each with its original probability. These individual 
probabilities are all at least (inf K)/n, where inf k := infj;^^^^ At„(x, y) > by 
assumption. 

Thus, for any of the at most n components C 7^ Ci of G^ with at least 
M = A\ogn/k vertices in Si, the probability that C sends no edges to Ci is 
at most 

(l-(infK)/n)^^l^il. 

As inf K > 0, |Ci| > p{K!)n/2 and M = A\ogn/k, we can make this proba- 
bility o{n~^^^) by choosing A large enough. □ 

Next, we turn to our general result Theorem 112.11 on the distribution 
of the small components of a supercritical graph G„ = G^ {n, Kn)- This 
will then be used to prove the final statement in Theorem I3.12| restated as 
Theorem 112.71 below. 

Proof of Theorem \12.1l The result is a simple consequence of Theorems 18.11 
I9.1fl[ 18.51 and 18.61 Indeed, given Gn = G^{n,Kn), let Ci be the largest 
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component of Gn (chosen according to any fixed rule if there is a tie), and, 
for each n, let y™ be the subsequence of consisting of those Xi for which 
i ^ Ci. Then, by Theorem 19.101 V = (5,/!, (y„)) is a generalized vertex 
space, where /I is defined by (i/i(x) = (1 — p{k; x)) dfj,{x); indeed, the only 
non-trivial condition to verify is ()2.4|1 . which is immediate from the same 
condition for V and (|9.2()|) . 

Next, we must show that the graphs G'^ and G^{n,Kn) may be coupled 
so that their edge-sets agree whp. This is easy to see: as usual, we condition 
throughout on x„, treating x„ as deterministic. If we condition also on the 
vertex set V of Ci, the only information this gives about edges of G„ inside 
= V{Gn)\V is that G'^ = Gn[V'^] contains no component larger than \ V\ 
(and that certain components of order exactly \V\ are ruled out). Without 

this condition, G^fX^'^] would have exactly the distribution of G^{n,Kn), 
so it suffices to prove that Ci(G^(n,K„)) < \V\ holds whp. In fact, as 
1^1 ^ p{n)n/2 whp, it suffices to prove that 



tends to as n ^ oo. 

Recall that the sequence x„ is deterministic. Pick a vertex j of G„ at 
random, and explore its component in G„ in the usual way, by finding the 
neighbours of j, then the neighbours of the neighbours, and so on. Let Vj 
be the vertex set of this component. The nature of the exploration process 
ensures that, given Vj, the edges of Gn\V^] are present independently with 
their unconditional probabilities. In other words, writing y,^ for the subse- 
quence of x„ consisting of those Xi with i ^ Vj, and setting V' = (5, /U, (y^)), 

the edge sets of G„[V^P] and G^' {n, «;„) have the same distribution. 

Let us condition on the event E that \Vj\ > p{K)n/2. As Theorem 13. ll can 
be applied to G„, and j was chosen at random, we see that F{E) is bounded 
away from 0. Also, appealing to Theorem 13.61 we see that whp G„ has a 
unique component of size at least p{K)n/2. Thus, given E, we have y^ = y^ 
whp. Hence, with probability ¥'{E){o{l) + r/), the graph G„ contains two 
components of order at least p{K)n/2. By Theorem 13.61 this probability is 
o(l), so we have r] = o(l), as required. Finally, as k is quasi-irreducible on 
(5,/n), it is quasi-irreducible on {S,p). 

It remains only to show that the sequence (k„) is graphical on V with 
limit K. Now (k„) is graphical on V with limit k, and all conditions of 
Definition 12 . 91 for V apart from the last, (|2.11j) . follow immediately from the 
corresponding conditions for V. In other words, we must show that 




Ee(G^(n,K„))/' 



n 




IL 



K{x,y)dp{x)dp{y) 



k{x, y){l - p{k; x))(l - p(k; y)) dp{x) dp{y). 
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As we may couple and G^{n,Kn) to agree whp, and as the number of 
edges in either graph is bounded by that in Gn, and thus, divided by n, 
is uniformly integrable, it suffices to prove the same limiting formula for 
Ee(G^)/n. This is immediate from the definition of G^, condition H2.11() 
for G„, and Theorem 13.51 □ 

Theorem 12.7. Let (k^) he a graphical sequence of kernels on a (gener- 
alized) vertex space V with irreducible limit k. If k is supercritical, i.e., 
||Tk|| > 1, and sup^. „ K„(x, y) < co, then C2{Gn) = O(logn) whp. 

Proof By Theorem ITTTl wh p C2(G 'n) = Ci(G^(n, and the result fol- 
lows from Theorems 112.31 and 112.51 □ 

Remark 12.8. One might think that C2(G„) = O(logn) would hold whp 
for any supercritical kernel k, i.e., that Theorems 112.61 and 112.71 would hold 
without the condition inf^^^^^^ k„ {x,y) > or supj. ^ „ Kn{x,y) < oo, at least 
for Kn = K, say. However, it is easy to construct counterexamples, similar to 
Example l8.61 by taking Xi = i/n £ S = {0,1] and modifying a suitable kernel 
to introduce a largish star with centre 1 not joined to the giant component. 

Such pathologies are not the only counterexamples: in the case where the 
Xi are uniformly distributed, for any function uj(n) = o{n), Example 14.121 
provides an example of a supercritical random graph with C2(G„) > uj{n) 
whp; thus the Op{n) bound in Theorem 13.61 is best possible. 

13. Vertex degrees 

In this section we turn to the vertex degrees, proving Theorem 13.131 if 
IS a graphical sequence of kernels on a vertex space V with limit k, and 
we define A(x) by 

:= j^K{x,y)dfi{y), 

then, writing Zf^ for the number of vertices of degree k in {n, Kn), our 
aim is to show that 

p _ i.^ ^ [ 



Z,/n ^ P(H = k)= nJr^ 

where H has the mixed Poisson distribution jgVo{\{x)) d^{x). 

In fact, Theorem l3.13l is stated for a generalized vertex space, and includes 
limiting results both for Z^/n and for Zk/\V{Gn)\. Since \V{Gn)\/n fJ-iS), 
these results are equivalent. As usual, the statement for generalized vertex 
spaces reduces to that for vertex spaces (see Subsection 18. If) , and what we 
must prove is exactly the statement above. 

Proof of Theorem VJ.l'JX Consider first the regular finitary case in Defini- 
tion ^31 Take a vertex v of type i, let Dy be its degree, and let Dy^j be 
the number of edges from v to vertices of type j, j = 1, . . . , r; thus = 
^ ■ Dy^j. Assume that n > maxK and condition on rii, . . . ,nr. Then the 
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D^j are independent for j = 1, . . . ,r, and Dyj ~ Bi(nj — 6ij, K{i,j)/n^ 
Po(//jK(i, j)) ; hence 



Sfo(J2 f^j^hj)) =Po(A(i)) 
j 

as A(i) = / j) dfi{j) = J2j Consequently, 



= k)^ P(Po(A(i)) = k) 



-e 



-A{i) 



A;! 

Let Zk^i be the number of vertices in G^{n, k) of type i with degree A;. Then, 
still conditioning on ni, . . . , n,,. 



- E Zk,i = -n, P(L>^ = k)^in P(Po(A(i)) = k) . 
n n ' 



It is easily checked that Var(Zfc j | rii, . . . , n^) = 0{n). Hence 

^Zk,i ^ P(Po(A(i)) = k)^ii, 
and thus, summing over z, 

-Zfc = -Zk,i ^ 5^P(Po(A(i)) = k)^x, = P(H = A;). 



n — ' n 

i I 

This proves the theorem in the regular finitary case. In general, define 
by H7.5|) . Let e > be given. From (|7.6() and monotone convergence, 
there is an m such that 

Kjyx, y) dfi{x) dfi{y) > jj^^ k{x, y) dfx{x) dn{y) - e. (13.1) 

For n > m we have «;~ < k„ by (|7.7|) . so we may assume that G{n, k~ ) C 
G{n,Kn)- (Here, as usual, we suppress the dependence on V.) Then, using 
Proposition 18.91 twice and ()13.1|1 . 

1 11 

-e(G(n, Kn) \ G{n, = -e(G(n, e(G(n, k~ )) 

n n n 



P 1 

2 



/52 

so whp e[G{n,Kn) \ G{n,K^)) < en. Let us write zj^'' for the number of 
vertices of degree k in G{n, k~ ). It follows that whp 

\zl^^ -Zk\< 2en. (13.2) 

Writing H^"*^ for the equivalent of H defined using k~ in place of k, by the 
first part of the proof, Z^"^^/n ^ P(H('") = /j). Thus whp 

|Z^"V^ - IPl^^""^ = A;)| < e. (13.3) 
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Finally, we have EH = X{x) dfj.{x) = JJg2 K{x,y) dfi{x) dfi{y). Since 
\i">-)(^x) < A(x), we can assume that H^™) < H, and thus 

p(s / = p(h-h(™) > 1) < e(h-h(™)) = If k- 

(13.4) 

Combining (ITT^ . (ITIOl) and (|n31), we see that |Zfc/n - P(H = /c)| < 4e 
whp. □ 

Let A be the random variable A(^), where ^ is a random point in S with 
distribution Then we can also describe the mixed Poisson distribution 
of H as Po(A). Under mild conditions, the tail probabilities P(H > t) and 
P(A > t) are similar for large t. We state this for the case of power-law 
tails; the result generalizes to regularly varying tails. As above, let D be the 
degree of a random vertex in G^(n, Let Z>k be the number of vertices 
with degree > k. 

Corollary 13.1. Let (k^) he a graphical sequence of kernels on a vertex 
space V with limit k. Suppose that P(A > t) = fi{x : A(x) > t} ~ at~" as 
t ^ oo for some a > and a > 1. Then 

Z>k/n ^ P(H > yfc) ~ aA;-", 

where the first limit is for k fixed and n — > oo, and the second for A; — > oo. 
In particular, lim„_^oo ^{D > /c) ~ ak~" as k ^ oo. 

Proof. It suffices to show that P(H > /c) ~ ak~°^] the remaining conclusions 
then follow from Theorem EUHl For any e > 0, P(Po(A) > t | A > (1 + 
e)t) 1 and P(Po(A) > t | A < (1 - e)t) = oit~") as t ^ oo, for example 
by standard Chernoff estimates |59| Remark 2.6]. It follows that P(H > t) = 
P(Po(A) > t) ~ at"" as t ^ oo. □ 

This result shows that our model does include natural cases with power- 
law degree distributions. For example, taking S = (0, 1] with the Lebesgue 
measure, and Kn{x,y) = K{x,y) = c/y/xy for c > constant, we have 
X{x) = 2c/V^, so P(A > t) = 4c^/t^ for t > 2c. Thus, by Corollary mm 
F{D > /c) ~ Ac^/k"^ as k ^ oo. In fact, in this case G^{n,K) is the 'mean- 
field' version of the Barabasi-Albert scale-free model; see Subsection 116.21 



For other power laws, see Subsection 116.41 

14. Distances between vertices 

One of the properties of inhomogeneous graphs that has received much at- 
tention is their 'diameter'. For example, considering the scale- free model of 
Barabasi and Albert [Hj, the diameter was determined heuristically and ex- 
perimentally to be 0(logre) in Plllf)|l8T]: for a precise version of this model, 
the LCD model, the value (1 + o(l)) log n/ loglogn was found rigorously 
in [221; later, this value was also found heuristically in [39] . 
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Often, the diameter is taken to mean the average distance between a 
random pair of vertices, or perhaps the 'typical' distance, although the usual 
graph theoretic definition (the maximum distance between a pair of vertices) 
is also used. Here we shall consider both interpretations. 

14.1. Typical distances. In this subsection we study the 'typical' distance 
between vertices; our aim is to prove Theorem 13. 141 giving upper and lower 
bounds on the distances between almost all pairs of vertices, showing that 
almost all pairs of vertices in the giant component are at distance roughly 
log n/ log ||T«;||. 

Many related results have been published, concerning random graphs with 
a fixed degree sequence, or random graphs with a given expected degree se- 
quence; we shall only describe a few here. These models are similar to 
(and in some cases special cases of) the rank 1 case of our model; see Sub- 
section 116.41 For example, Chung and Lu |3E1 EEl studied distances in a 
'random graph with given expected degrees'. For power-law degrees with 
exponent /3 > 3, where their model is a special case of ours, they obtained 
an asymptotic diameter of log n over the log of the average of the squares 
of the degrees, a special case of Theorem I3.14| see Subsection 116.41 for the 
connection to our model. 

Van der Hofstad, Hooghiemstra and Van Mieghem |^ (see also |54[ I55[ 
156) ) studied a model where the vertex degrees are i.i.d. with a certain dis- 
tribution, and the graph is chosen uniformly among all graphs with these 
degrees. They analyze the growth of vertex neighbourhoods in this model 
by using a branching process; this process is single type, but the number of 
children of a particle is not Poisson, so it is rather different from the one 
considered here. They obtain very precise results on the distances between 
a random pair of vertices, showing that it is log n/ log c + Op(l), where c 
is the expectation of d{v){d{v) — 1). (The —1 here comes from the degrees 
being (conditionally) fixed, rather than essentially Poisson.) 

There are many other papers in this area, both heuristic and mathemat- 
ical; we shall not attempt to list them. Let us mention only that Fernholz 
and Ramachandran [501, while mainly focussing on the diameter (see Sub- 
section 114. 2() also treat the typical distance between vertices. For further 
references we refer the reader to the discussion of related work in '53|. 

Let us now begin our preparation for the proof of Theorem 13.141 Let 
Kn be a graphical sequence of kernels on a vertex space V with limit k, 
and let G„ = {n, k^)- Note that we do not consider generalized vertex 
spaces; arguing as in Subsection 18.11 to prove Theorem 13.141 it suffices to 
consider vertex spaces. (For part (iv), we use also the fact that, by standard 
arguments, the conclusion holds if and only if (|3.1()j) holds for /(n) = rylogn, 
for every i] > 0.) We shall write dci^v, w) for the graph distance between two 
vertices v, w a graph G, taking dG{v,w) = oo if v and w are not in the 
same component of G. When the graph G is not specified, G„ = G^{n, Kn) 
is to be understood. 
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Lemma 14.1. If k is quasi-irreducible, then 

\{{v, w} : d{v, u;) < 00} I = + o,{n') = + o,{n'). (14.1) 

Proof. As noted in Subsection l3.51 by H3.9() this is immediate from Theorems 
EUandEll □ 



In particular, almost all pairs with either or both vertices outside the giant 
component are not connected at all, so we shall study the typical distance 
only in the supercritical case ||Tk|| > 1. 

Lemma 14.2. Let n be a regular finitary kernel on a vertex space V with 
\\Ti^\\ > 1. For any e > 0, 

E\{{v,w} :d{v,w) < (l-e)logn/log||T«,||}| =o{n^). 

Proof. Changing only the notation, we may assume that the type space S is 
finite, say 5 = {1, 2, ... , r}. It turns out that, as usual, we may assume that 
> for every i; however, here we cannot simply ignore Op{n) edges, 
so an argument is needed. Suppose that /^({i}) = for some i. Taking 
i^'ihj) = f^'ijyi) = niax/t for all j, and K'{j,k) = K{j,k) for j,k 7^ i, we 
have K < k', so we may couple G„ := G^{n, k) with = G^{n, k') so that 
Gn ^ G'^. Note that \\Ti^\\ = \\Ti^'\\, as k = k' a.e. 

For any r/ > 0, define by //'{i} = rj and = (1 — ??)a*({j}), J 7^ i- 

Thus /i' is obtained from ^ by shifting some measure from types other than 
i to type i. Changing the types of some vertices correspondingly, we obtain 
a vertex space V' = (5,//', (x^)„>i) such that whenever a vertex has type 
j in V, it has either type j or type i in V'. As K'{j,k) is maximal when 
one or both of j and k is equal to i, it follows that we can couple G'^ and 
G'^ = G^'{n,K' ) so that G'^ C G". As r/ ^ 0, the norm of T^' defined with 
respect to /i' tends to the norm defined with respect to /i. Since G„ C G'^, 
to prove Lemma 114.21 for G„, it thus suffices to prove the same result for 
G'^, defined on a vertex space with fJ-'{i} > 0. Iterating, it suffices to prove 
Lemma Il4. 21 in the case where /^({i}) > for every i. 

Let Tci{v) = Tii{v,Gn) denote the d-distance set of v in Gn, i.e., the set 
of vertices of Gn at graph distance exactly d from v, and let T^div) = 
r^div, Gn) denote the d-neighbourhood IJd'Kd^d'iv) of v. 

Let < e < 1/10 be arbitrary. The proof of (|12.1j) involved first showing 
that, for n large enough, the neighbourhood exploration process starting at 
a given vertex v of G„ with type i (chosen without inspecting Gn) could 
be coupled with the branching process X(i_,_2e)K(i) so that the branching 
process dominates. In particular, the two processes can be coupled so that 
for every d, \T(i{v)\ is at most the number N^i of particles in generation d 
of ^(i+2e)K(0- Elementary properties of the branching process imply that 
ENd = 0(||T(i+2e)«f ) = 0(((1 + 2e)A)'^), where A = ||r,|| > 1. 
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Set D = {1- lOe) log n/ log A. Then D < {1 - e) log n/ log((l + 2e)X) if e 
is small enough, which we shall assume. Thus, 

D 

E \T<d{v)\ <EY,Nd = 0(((1 + 2e)Xf) = 0{n^'') = o{n). 

d=0 

Summing over v, the expected number of pairs of vertices within distance 
D is o(n^), and the result follows. □ 

We now turn to the reverse bound, showing that most vertices in the giant 
component are within distance roughly log n/ log ||Tfj || . First we consider two 
random vertices. 

Lemma 14.3. Let k be a quasi-irreducible regular finitary kernel on a vertex 
space V with ||Tk|| > 1, and let v and w be two vertices of {n, k) chosen 
independently and uniformly. Then, for any e > 0, 

F{d{v,w) < (l+e)logn/log||T^||) p{Kf 

as n ^ oo. 

Proof. Note that an upper bound + o(l) follows from Lemma ll4.11 so 
it suffices to prove a corresponding lower bound. This time we may simply 
ignore types i with ^({i}) = 0, working entirely within the subgraph of 
Gn = G^(n,K) induced by vertices of the remaining types. As there Op{n) 
vertices of types i with = 0, changing e slightly it suffices to prove 

the result for G^. Thus we shall assume that > for every i. Also, 

restricting to a suitable subset of the types and renormalizing, we may and 
shall assume that n is irreducible. 

Fix < r/ < 1/10. We shall assume that rj is small enough that (1— 2?7)A > 
1, where A = \\T^\\. In the argument leading to (|9.5|) in proof of Lemma 19.61 
we showed that, given u){n) with u;(n) = o{n) and a vertex v of type i, 
the neighbourhood exploration process of v in Gn could be coupled with 
the branching process .^(i_2jj)k(0 so that whp the former dominates until it 
reaches size oj{n). More precisely, writing N^^k for the number of particles 
of type k in generation d of ^{i-2ri)K{})-, and T^^kiv) for the set of type-/c 
vertices at graph distance d from v, whp 

\Vd^k{v)\>Nd±,k = l,...,r, forallds.t. \T<d{v)\ < uj{n). (14.2) 

The key point is that this coupling works because we have only 'looked at' 
o(n) vertices at each step. 

Let us call a kernel k bipartite if 

S = L U R, with K{i,j) = whenever i,j £ L or i,j G R, (14.3) 

in which case the graph G„ is bipartite. For the moment, let us suppose 
that K is not bipartite. Let Nt{i) be the number of particles of type i in 
the tth generation of X^, and let Nt be the vector Nt{l), . . . , Nt{r). Also, 
let V = {ui, . . . , Ur) be the eigenvector of k with eigenvalue A (unique, up to 
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normalization, as k is irreducible). From standard branching process results, 
for example, [HI Theorems V.6.1 and V.6.2], we have 

Nt/A* ^ Xiy a.s., (14.4) 

where X > is a real- valued random variable, X is continuous except that 
it has some mass at 0, and X = if and only if the branching process 
eventually dies out. 

Let D be the integer part of log(n-'^/^"'"^'') / log((l— 2?7)A). From 1)14. 4() . whp 
either = 0, or ^ > ri^/^+^ for each k. Furthermore, as lim^^^oo '^{^^d ¥^ 
0) = p{{l - 2r])K) and ^ oo, we have F{Nd / 0) ^ p{{l - 2r])K). Thus, 
if n is large enough, 

P (yk : Ndm > n^/2+'') > p((l - 2r])K) - r/. 

By Theorem 16.41 the right-hand side tends to p(k) as — > 0. Hence, given 
any fixed 7 > 0, if we choose rj small enough we have 

P (yk : ND,k > ^1/2+'') > p{k) - 7 (14.5) 

for n large enough. It is easy to check that E{\T<d{v)\) = 0(^^/2) if rj IS 
small enough; for example, we may argue as in the proof of Lemma 114.21 
Hence, 

\^<Div)\ < r?''^ whp, (14.6) 
and whp the coupling described in ()14.2p extends at least to the D-neigh- 
bourhood. 

Now let V and w be two fixed vertices of G^(n, k), of types i and j 
respectively. We explore both their neighbourhoods at the same time, stop- 
ping either when we reach distance D in both neighbourhoods, or we find 
an edge from one to the other, in which case v and w are within graph dis- 
tance 2D + 1. We consider two independent branching processes 3£(i_2r;)K(^), 
^(i-2»?)k(-?)' '^ith Nd^k and N'^f^ vertices of type k in generation d respec- 
tively. By (|14.6)) . whp we encounter o(n) vertices in the explorations so, by 
the argument leading to (|14.2|) . whp either the explorations meet, or 

\^D,k(.^)\ ^ ^D,k and \rn,k(.w)\ > N'^^k^ k = l,...,r. 

Using (|14.5|) and the independence of the branching processes, it follows 
that 

F(^d{v,w) < 2D + lory k:\TD, kiv)\ AT D,kiw)\ > 71^/2+^) 

>(p(k)-7)'-o(1). (14.7) 

Conditional on the second event in (|14.7|) holding and not the first, we have 
not examined any edges from rD{v) to rD{w), so these edges are present 
independently with their original unconditioned probabilities. For any i', j', 
the expected number of these edges is at least \T ^^i/ {v)\\r dj' {v)\K{i' , j') /n. 
Choosing i' , j' such that K{i',j') > 0, this expectation is Q{{n^^'^'^^)'^ /n) = 
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Q{n'^^). It follows that at least one edge is present with probability 1 — 
exp(— r2(n^'')) = 1 — o(l). If such an edge is present, then d{v, w) < 2D + 1. 
Thus, (|14.7|) implies that 

F{div, w) <2D + l)> {p{k) - -if - o(l) > p{Kf - 27 - 0(1). 

Choosing r/ small enough, we have 2D + 1 < (1 + e) log n/ log A. As 7 is 
arbitrary, we have 

¥{d(v, w)<{l + e) log n/ log A) > pinf - o(l), 

and the lemma follows. 

The argument for the bipartite case is essentially the same, except that 
if V and w are of types in the same class of the bipartition, we should look 
for an edge between TdIv) and Td^iIw). □ 

Lemmas 114. II and 114. (-il have the following immediate consequence. 

Corollary 14.4. Let k he a quasi-irreducible regular finitary kernel on a 
vertex space V with \\Ti^\\ > 1. For any e > 0, 

\{{v,w} : d{v,w) < (1 + e)logn/log||T«||}| = p{Kfn^/2 + Op{n'^). (14.8) 

Proof. It follows from Lemma [l4.1l that the expected number of vertex pairs 
{v, w} with d{v, li;) < 00 is p{tif'n^ /2 + o(n^). 

Fix e > 0. From Lemma ll4.3l the expected number of pairs of vertices at 
distance less than d = {1 + e) log n/ log \\T,J\ is p{Kfn^/2 + o(n^). Hence, 
the expected number of pairs with d < d{v,w) < cxd is o(n^), so there are 
Op(n^) such pairs. Using ()14.1|) again, ()14.8|) follows. □ 

After this preparation it is easy to deduce Theorem 13. 141 As noted earlier, 
it suffices to consider vertex spaces, rather than generalized vertex spaces. 

Proof of Theorem \3.14\ Let graphical sequence of kernels on a vertex 

space V with limit k, let G„ = G^{n, Kn), and let e > be fixed. We must 
prove four statements, which we recall separately below. 



(i) The first part of Theorem 13. 141 is exactly Lemma ll4.1l (but with quasi- 
irreducible replaced by irreducible), which we have already proved. 

We must show that if sup^ ,^ „^ y) < 00, then only Op{n'^) vertices 



of Gn are within distance (1 — e) log n/ log ||Tk||. As usual, we approximate 
with the regular finitary case. Let be a sequence of regular finitary ker- 



nels on V with the properties guaranteed by Lemma 17. 21 By Lemma [7.^riii)[ 

11^..+ II > 1, so there is an m such that (1 — e/2)/log ||T + || > 

(1 — e)/log llTftll. Fixing such an m, we may couple Gn and G^{n,K^) 
so that Gn ^ G^(n, K^) for n > m, and the result follows by applying 
Lemma ll4.2l to G^(n, k^) with e/2 in place of e. 



(iii) This time we must show that if k is irreducible and \\Ti^\\ < 00, 



then p{k) n /2 + Op{n ) pairs of vertices of Gn are within distance (1 -|- 
e) log n/ log ||Tk||. By (|14.1|) . it suffices to prove the lower bound. Again, 
we approximate with the regular finitary case, this time working with a 
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graph G'^ = G^{n,k^^) C Gn- The argument is as above, but using the 
approximating kernels k~ given by Lemma f7.3l instead of k^, and apply- 
ing Corollary 114.41 with e/2 in place of e: by Lemma 17. ."^fi)! the quasi- 
irreducibility condition of Corollary 114.41 is satisfied, while Lemma I7..'^rii)| 
implies \\ / ||r«;||. 



(iv) This time we must show essentially that if k is irreducible and ||Tk| 



oo then almost all pairs of vertices in the giant component are within distance 



o(logn). First, let r/ > 0. By the same proof as for part (iii) above, except 
that we have IIT-- 11 oo, we see that p(K)^r? 12 + Or^iri^) pairs of vertices 
of Gn are within distance 77 log n. Since r\ is arbitrary, a standard argument 
shows that we can replace 77 log n by some function f(n) = o(logn). □ 



Remark 14.5. Part |(ii)| of Theorem 13.141 does not hold if we omit the 
condition that sup^ ^ ,j y) < 00, even if = k for all n, with ||Tk|| < 
00. To see this, let S = [0, 1] with // the Lebesgue measure, and let (x„)„>i 
be disjoint deterministic sequences such that V = (5,/^, (x„)„>i) is a vertex 

space. We shall write x„ as (a;^"'^ . . . , xi"^) to emphasize the dependence 
of the terms on n; for example, we may take xf' = {i- V2/2)/n. Taking 
K{x,y) = 2 for all x, y, the graph G^(n, k) is a supercritical Erdos-Renyi 
random graph. 

Forming k' by modifying k on a set of measure zero, we can effectively 
add or delete o{n) given edges to/from k) whilst keeping k' graph- 

ical on V with e. In particular, given f{n) = o(n), taking 

k'{x^\x^^^) = n?, say, for 1 <i < f{n), we may ensure that in the graph 
G'n = G^ {n, k'), whp the vertex 1 is joined to all of the vertices 2, 3, ... , f{n). 
By Theorem 13.11 the giant component still has p{2)n + Op{n) vertices. 

For any uj[n) ^ 00 it is easy to check that if we choose /(n) large enough, 
all but Op{n) vertices in the giant component are within distance u}{n) of one 
of the vertices 1,2,..., /(n), and thus, all but Op{in?) pairs of vertices in the 
giant component are within distance 2u;(n) -|- 2. Hence, even if \\Tf^\\ is 
bounded, the typical distance between vertices may be smaller than any 
given function tending to infinity. 

Even if we allow \\T^\\ = 00, the typical distance cannot be as small as a 
constant: one can check that when k is irreducible, for any C there are whp 
0(n^) pairs of vertices in the giant component at distance at least C. In 
fact, there are G(n) vertices in the giant component whose C-neighbourhood 
is a path. This can be proved using a combination of the arguments leading 
to Theorems I3.1l and l9.11 

Remark 14.6. Using the same vertex space as in Remark 114.51 for any 

/(n) = o(n), we can define a graphical sequence of kernels k" with (irre- 
ducible) limit K = 2 such that whp G^{n,K'^) is obtained from the Erdos- 
Renyi graph G{n, 2/n) by deleting f{n) + 1 vertices and replacing them with 
a path of length f{n) not joined to the rest of the graph. In this graph there 
are at least (/(n)/3)^ pairs of vertices at distance at least /(n)/3. Hence the 
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average distance between vertices (counting only pairs at finite distance) is 
at least r2(/(n)^)/n^, which may be much larger than O(logn): in fact, it 
may be larger than any given function that is o(n). This is the reason for 
considering the distances between almost all pairs rather than the average 
distance in Theorem l^.l^fii 



Remark 14.7. In certain cases, we know better bounds than o(log n) on the 
typical distances between vertices. For example, the (l+o(l)) log n/ log log n 
formula for the diameter of the m = 2 LCD model proved by Bollobas 
and Riordan |23] certainly holds as a bound on the typical distances in the 
much simpler 'mean-field' case described in Subsection ll6.2( where k{x, y) = 
l/-yjxy and = i/n. 

However, without further restrictions (which could be on At, or on the 
distributions of the Xj), we cannot strengthen the o(logn) bound in part 
|(iv)| of Theorem 13.141 Indeed, given a graphical kernel k on a vertex space 
V = (5, /i, (x„)„>i) with IIT^II = cxD, and any function g{n) — > oo, set- 
ting Kn = K A g{n) we have E|r(;(r;)| < g{n)'^ for a given vertex v and 
every d > 0. Indeed, we have G{n,Hn) ^ G{n, g{n)/n). The argument 
in the proof of Lemma 114.21 shows that the typical distance is at least 



(1 + o(l)) log n/ log g{n). Hence the o(logn) bound in part (iv) is best possi- 
ble. A similar example may be constructed with a fixed kernel k by modify- 
ing the sequences (x„)„>i appropriately; take for example K{x,y) = 1/y^xy 
and Xi = max|z/n, l/5'(n)}. 

14.2. The diameter. Let k be a kernel on a (generalized) vertex space V 
in which the set of types is finite. In this subsection we study the diam- 
eter of Gn = G^{n,K), measured in the usual graph theoretical sense for 
disconnected graphs: 

diam(Gn) := ina.x{d(v,w) : v,w £ V{G), d{v,w) < oo}, 

where d{v, is the graph distance between v and w in Gn- 

The following is a partial list of existing work on the diameter of sparse 
random graphs: Bollobas and Fernandez de la Vega jTH] found the asymp- 
totic diameter of random r-regular graphs, Luczak obtained detailed 
results for G{n,c/n) with c < 1, Chung and Lu studied G{n,c/n), 
c > 1, and Fernholz and Ramachandran obtained a precise result for 
random graphs with i.i.d. degrees (see below). 

When Gn has finite-type, provided k is not critical we can easily find the 
diameter of G„ in the form (c + o(l)) logn. The constant c = c{k) will be 
obtained from the branching process in a simple way, different in the 
sub- and super-critical cases; see Theorems 114.81 and 114.1 ll below. Together, 
these results, which we shall prove separately, constitute Theorem 13.161 

The results in this subsection correspond to those of Fernholz and Ra- 
machandran |5f)j for a different model, where the distribution of the vertex 
degrees is fixed, the vertex degrees are sampled independently from this dis- 
tribution and, conditional on the degree sum being even, the graph is then 
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chosen uniformly at random from all graphs with the given degree sequence. 
The special case of our model where k has rank one is a special case of this 
model; see Subsection 116.41 In general, the two models are different. The 
proofs in [201 are much more complicated than those we shall present here, 
because their model does not have independence built in. Thus, roughly 
speaking, Fernholz and Ramachandran have to work to get the branching 
process approximation that we have here as our starting point. Also, here 
we keep things simple by considering only the finite type noted in 

Subsection 18.51 even the single type case is non-trivial. 

Throughout this subsection, when exploring the neighbourhoods of a ver- 
tex V in Gn = {n, k), we fix in advance an arbitrary order on the vertices 
of Gn- At each step in the exploration, among unexplored vertices at mini- 
mal distance from v, we choose the first vertex w in this order, and reveal all 
edges from w to vertices not yet reached by the exploration. In this way we 
reveal the vertex sets of the neighbourhoods Tt{v), t = 1,2, . . . successively. 
Furthermore, the graph we reveal is always a tree, rooted at v. We shall 
denote this graph by T{v), and call it the reduced component of v. For t > 
we write Tt{v) for T{v) H T<t{v), the reduced t -neighbourhood of v. 

Specifying which unexplored vertex to choose next does not affect the 
coupling arguments leading to 1)9. 5|) . for example, where any unexplored 
vertex could be chosen at each step. The advantage is that the tree T{v) 
is uniquely specified even if the component containing v has cycles; below 
we shall sum the probability that the reduced component of a vertex is a 
particular tree over all trees. Using the reduced component guarantees that 
the corresponding events are disjoint. Note that if v and w lie in the same 
component, then d{v, w) is the same as the graph distance in T{v) between 
the root, v, and w. 

We shall first prove the subcritical case of Theorem 18. restated below. 

Theorem 14.8. Let k be a kernel on a (generalized) vertex space V = 
{S, n,{xn)), with S = {1,2, ...,r} finite and n{{i}) > for each i. If 
< IIT^II < I, then 

diam(G„) p 1 
logn logllTKll"^ 
as n ^ oo, where Gn = G^ {n, k). 

Proof. We may assume without loss of generality that k is irreducible. Also, 
by conditioning on the sequences (x„), we may assume that V is a vertex 
space, and that the number of vertices of type i is deterministic (see 
Subsection 18. Ij) . 

Let pd,i be the probability that the branching process X^ii) survives for 
at least d generations. As the number of particles in the first generation that 
have descendants in generation d + 1 has a Poisson distribution, we have 

Pd+i,i = 1 - exp(^- ^ K{i,j)fi{{j})pdjy 
j 
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Recalling that pd^i \ p{K,i) = as d ^ oo, and using 1 — exp(— x) = 
X + 0{x^), it follows easily that, for each i, 

Pd,i = i\\Tj+oil)f as d^oo. (14.9) 

Let Lo = u){n) = j41ogn, where A is a constant, chosen large enough that the 
estimates below hold. As jC^ is subcritical, p>k{K) decays exponentially with 
k; see (fTT^ . Thus /0>^(k) =o{n-'^), say. Let d= (1 ± e) logn/ log ||r«;||-^ 
where e is a small positive constant; we shall consider both choices of sign 
below. 

Let T be a rooted tree where each vertex has a type from S = {1, 2, . . . , r}. 
We shall say that a tree T is relevant if it has height at least d and contains 
at most uj vertices. Let tt{T) be the probability that jC^ is isomorphic to T, 
in the natural sense. 

The sum of ir{T) over relevant T is the probability that survives at 
least d generations and contains at most oj particles in total, which is (||Tk|| + 
o(l))'^ + o(n-2) = n-iT^+°W. 

Let p{T) be the probability that the reduced component of a random 
vertex v of G„ is isomorphic to T in the natural sense. From the step-by- 
step exploration, one can check that 

p{T) = (1 + o(l))l^l7r(r) = n°(i)7r(r) 

for any relevant T: the proof is similar to that of Lemma 111. 41 but one shows 
that at each step the conditional probability of finding the right number 
a of new neighbours of a particular type in the graph is within a factor 
(1 -|- o(l))'^^^ of the corresponding Poisson probability, as long as both a 
and the number of previously uncovered vertices are o(n). Let a be the sum 
of p(T) over relevant T. Then it follows that a = n^^^'^^°^^\ In particular, 
na = o(l) if we take the plus sign in d = (1 ± e) log n/ log ||Tk||~"^, and 
na — > oo if we take the minus sign. 

Using (|12.2|) again, the expected number of vertices in Gn with more 
than uj{n) vertices in their (reduced or unreduced) component is o(l) (in 
fact, o{n~^^^)), so whp there no such vertices. Taking the plus sign in 
d, the expected number of vertices in G„ whose reduced component is a 
relevant tree is na = o(l). Together, these bounds show that whp every 
vertex v is such that all w with d{v,w) < oo have d{v,'w) < d. Thus, whp 
diam(G„) < d = {1 -|- e) logn/ log ||Tk||^"'^. As e > was arbitrary, this 
proves the upper bound in Theorem 114.81 

For the lower bound we take the minus sign in d, so na = n^"^°^^) oo, 
and use the second moment method. The key point is that if T, T' are rele- 
vant trees, and p(T, T') is the probability that independently chosen random 
vertices w have vertex- disjoint reduced neighbourhoods isomorphic to T, 
T' respectively, then 



p{T, T') = (1 + 0{u:/n)rp{T)p{T') ~ p{T)p{T'). (14.10) 
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This again follows from the step-by-step exploration, as finding one tree uses 
up at most uj vertices. (Note that we do not have p{T) ~ vr(T), because all 
we know about rij is that ni ~ n//({i}).) 

Let be the indicator function of the event that the reduced neigh- 
bourhood of t; is a relevant tree, and let A'' = X]"=i s° ^{N) = na. 
Expanding E(A^^) = Yl,v^w^^'^'"'^'t")i contribution from pairs v, w 
in the same component is at most ^^E(Xt,u;) = naijo: if the component 
containing v is relevant, then by definition it contains at most lo vertices w. 
Using H14.1Up above, it follows that 

E(iV2) = (1 + o(l))nV2 + O{n(TL0). 

Now E(A^) =na = which is much larger than uj, so E(iV2) ~ E(A^)2, 

and whp > 0. So whp there is a vertex v whose neighbourhood is a 
relevant tree, and thus includes a vertex w at distance at least d = (1 — 
e) log n/ log IIT^II""'^, completing the proof of Theorem 114.81 □ 

Remark 14.9. Recall that p{T) was defined as the probability that the 
reduced neighbourhoods of a random vertex v are isomorphic to T, allowing 
the possibility that there are some edges within each Td{v). The reason 
was that, in proving the upper bound on the diameter, we must rule out 
components of large diameter that contain cycles, as well as components 
that are trees. If we redefine p{T) to exclude edges within each T(i{v), 
then p{T) changes by a factor (1 + 0(|T|/n))l"^L As we only ever consider 
T with |T| = O(logn), this factor is 1 + o(l), and all our estimates go 
through. In particular, whp Gn contains a tree component of diameter at 
least (1 — e) logn/ log ||Tk||~-^. 

Remark 14.10. One might expect Theorem II 4 . 81 to generalize immediately 
from S finite to (at least) the case n bounded. However, diam(G) does not 
always decrease when an edge is added to G, as the new edge might join two 
components. Thus one cannot just sandwich G^(n, k) between finite-type 
graphs and apply Theorem 114.81 In the unbounded case, the construction 
described in Remark 114.61 shows that for any a;(n) = o(n) one can construct 
a graph G^{n,K) with k supercritical, such that diam(G^(n, k)) > u>{n) 
whp. Modifying the construction by starting with a subcritical Erdos-Renyi 
graph gives an example with k subcritical. 

We now turn to the supercritical case of Theorem \'A.W\ restated as The- 
orem ^^^2 below. Recall that, given a supercritical kernel k on a ground 
space (5,/i), there is a 'dual' kernel k on a ground space (5,/i), defined 
as follows: as a function on S x S, k = (1 — p{k))k, while dfi{x) = 
{1 — p{k]x))/{l — p{n))d^{x). In particular, in the finite-type case, p{{i}) = 
(1 — p{k;i))/{l — p{k,)) p{{i}) . Note that we have chosen to renormalize the 
dual kernel defined in Definition 13.151 so that p is a probability measure. 
As discussed after Definition l3.15( this makes essentially no difference; how- 
ever, it allows us to speak of the branching process started with a particle 
whose type is chosen according to fi. (In the remark after Definition 13.151 
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we wrote k' and fi' for the renormalized dual kernel and associated measure; 
here we write k and jl for notational convenience.) When we write \\Tf^\\, we 
mean the norm of defined with respect to {S,fi). 

Theorem 14.11. Let k. be an irreducible kernel on a (generalized) vertex 
space V = (x„)), with S = {1,2, ... ,r} finite and /^({i}) > for 

each i. If ||Tk|| > 1, then 

diam(G„) p 2^1 

logn iog||r«,||~i iog||r«;||' 

where Gn = {n, k). 

The relevance of the dual kernel is that it describes components other 
than the giant component. In particular, it follows from Theorem 1 1 2 . II and 
Theorem 114.81 that the diameter of the largest 'small' component of G„ will 
be (1 + o(l)) log n/ log ||T;5;||~^. As we shall see, the same quantity will give 
the height of the tallest tree attached to the two-core. The diameter will 
be given by two such trees attached to vertices of the two-core at typical 
distance, (1 + o{l)) log n/ log ||Tk||. 

The idea of the proof is as follows: instead of considering the event that 
the neighbourhoods of a vertex v form a tree of height at least d, we consider 
the event that the neighbourhoods are thin for d generations, meaning that 
each generation has size at most to, with u = ^logn as before. For the 
upper bound, we will show that whp no vertex has neighbourhoods that are 
thin for more than d = (1 + e) logn/ log generations, where a = \\Tf^\. 
For the lower bound, we will find two trees of height roughly d attached to 
typical vertices of the two-core. 

The reason for considering thin neighbourhoods is that, once Tt{v) is 
larger than ^logn for some t, the neighbourhoods rs(f), s > t, grow rea- 
sonably rapidly. 

From now on, we assume that k is an irreducible kernel on a finite ground 
space with > for each i, and that the number nj of vertices 

of each type i is deterministic, with Ui/n —> //({i}). As before, it suffices to 
prove Theorem 114.111 under these assumptions. Let 

t{v) := min{r : |rr(u)| > uj} 

denote the index of the first thick neighbourhood of a vertex v, when there 
is one. 

Lemma 14.12. For any e > 0, whp the graph Gn does not contain two 
vertices V, w with the properties thatt{v), t{w) are defined, t{v), t{w) < n^/^, 
and d{v, w) > t{v) + t{w) + {1 + e) logn/ log \\T,^\\. 

Proof. We show that the expected number of pairs is o(l), by showing that 
the probability that a random pair v, w has the properties is o(n~^). Explore 
the neighbourhoods of v and w simultaneously, stopping at the first thick 
neighbourhood of each, if there is one. Suppose, as we may, that t{v) and 
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t{w) are defined and at most n^/^. Then with very high probabihty we 
have seen o(n) vertices (the neighbourhoods can't have grown too much in 
the last step). If the neighbourhoods have already joined, we are happy. 
Otherwise, continue exploring. Simple Chernoff bounds show that for any 
?7 > 0, if ^ is chosen large enough, with probability 1 — o{n~^^^) the number 
of vertices of each type found at each subsequent step is within a factor l±r] 
of its expectation. It follows that the neighbourhoods grow by a factor of 
(l±2r7)||TK|| at each step, after a few steps to allow the distribution of types 
to converge to the relevant eigenvector of k. Once both neighbourhoods 
reach size n^^'^'^^, they join at the next step with very high probability. □ 

For the rest of this section, let a = \\Tf^\\ denote the norm of the dual 
kernel k. As S is finite, a < 1 by Theorem ll2.3l Recall, from the discussion 
before Lemma 16.61 that has the same distribution as conditioned on 
extinction. Let td,n be the probability that stays alive but thin for d 
generations: 

td,n-=^{l<\Xt\ <io : l<t<d), 

where Xt is generation t of X^. Note that depends on n, via the definition 
of UJ. 

Lemma 14.13. For any rj>0,ifnis large enough, then 

{a - < td,n < (a + (14.11) 

holds for all d in the range ^ log n/ log a""^ < d < 2 log n/ log q^"*^. 

Proof. We start with the lower bound. 

Let j be the probability that the branching process Xf^{i), started with 
a particle of type i, survives for at least r generations. By ()14.9|) . we have 
p'^j^ = {a + o{l)Y as r ^ oo. Let p'^ji be the probability that generation r 
of Xfi{j) consists of a single particle of type i. As the branching process is 
subcritical, one can check that p'^a = &{p'ri) for r even. (The restriction 
r even is only needed if the kernel k is bipartite, in the sense of ()14.3|) .') 
We shall need only the much weaker statement that p"j j = (a + o(l))''" as 
r — > oo with r even; this can be proved along the same lines as (|14.9|1 : let 
A^i be the number of particles x in the first generation of Xfi{j) with the 
property that the descendants of x in generation d + 1 consist of a single 
particle of type i. Let N2 be the number of particles x in the first generation 
of ^^(j) with more than one descendant in generation d + 1 of type i, or any 
descendants of types other than i. Note that A^i and N2 have independent 
Poisson distributions, with 

EiN,) = Y,f^U,k)mpl,^„ 

k 

and 

k 
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Since pI^ - < p'a,^ < p>d{k;k) ^ 0, we have E(iVi), E{N2) ^ 0. By 
definition, 

Pd+ij,. = = 1,N2 = 0)= E(Ari) exp(- E(iVi) - E(iV2)) ~ n^i). 
In other words, 

Pd+i,j,i = (1 + o(l)) k)il{k}p'li. i. 

k 

Recahing that a is the norm of defined with respect to fi, it follows that 
P'ri i = ('^ + o{l)Y as r ^ oo with r even. 

The probability that Xfi has any thick generation at all is at most the 
expected total size of divided by uj, which is 0(1/cj) = 0(1/ log n). Tak- 
ing r — > oo slowly enough, for example r = 2 [log log log n] , this probability 
is much smaller than p"^^. Hence, with probability p'" = (a + o{l)Y the 
branching process remains thin for r generations, and the rth genera- 

tion is a single particle of type i. Restarting, we see that for any d we have 
td,n > fi{{i}){Pr'i)^'^^''\ and the lower bound in (I14.11|l follows. 

Let c > be a (small) constant. Simple Chernoff bounds show that, 
given that a certain generation t of has size at least clogn (and given 
the numbers of particles of each type), generation t + ci has size at least u! 
with probability at least 1 — n~'^^, for some constants ci, C2 depending on 
c and K. Hence, if L = L{n, c) is chosen large enough, the probability that 
stays thin for d generations and has size at least c log n for the last L of 
these is at most n~^^^: given that generation d — L has size at least clogn, 
the probability that generation d — L + ci is still thin is at most n~^'^ . If this 
generation is thin but also has size larger than clogn, generation d — L + 2ci 
is unlikely to be thin, and so on. 

Hence, td,n is within n~^^^ of the probability that stays thin for d 
generations and one of the last L of the first d generations has size at most 
clogn. Let Qr = qr,c be the probability of the event that survives for at 
least r generations, that the first r generations are thin, and that generation 
r has size at most clogn. We have shown that 

td,n< Yl '?r + n-io°. (14.12) 

d-L<r<d 

As a generation of size clogn has probability n~^^'^^ of dying out immedi- 
ately, the probability that X^, survives for exactly r generations is at least 
qrn~'^^^\ Hence, j, the probability that Xf^, which is just conditioned 

on dying out, survives for r generations is also at least qrn~^^^\ Using 
p'ri — (0^ + 0(1))'", it follows that Qr < n'^^^^ (« -|- o(l) )^ . In particular, choos- 
ing c small enough (depending on k and r/), for r > ^ log n/ log a^^, say, we 
have Qr < ia + r//2)''. Using (|14.12|) . it follows that 



td,n < L{a + r?/2)'^-^ + n-i°° < (a + ry)'^ 
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for n large enough and d in the range considered in (|14.1H) . completing the 
proof of Lemma 114.131 □ 

Let T be a rooted tree in which each vertex has a type from S. Let h{T) 
be the height of T, i.e., the maximal distance of a vertex form the root. 
We shall say that is consistent with T if the first h(T) generations of 
Xk are isomorphic to T, and write t^'{T) for the probability of this event. 
Analogously, we say that the reduced neighbourhoods of a vertex v of Gn 
are consistent with T if T^rp){v), the tree formed by the first h{T) such 
neighbourhoods, is isomorphic to T. We write p'(T) for the probability that 
the reduced neighbourhoods of a random vertex of G„ are consistent with 
T. The definitions are almost the same as those of vr(T), p{T), except that 
we do not care what happens after the first h{T) generations. (We could 
have used these definitions in the subcritical case - there it was not essential 
that we explored the whole component.) 

We are now ready to prove Theorem 114.111 

Proof of Theorem \14-ll\ As before, we may assume that V is a vertex space, 
and that the number Ui of vertices of type i is deterministic. 

Let e > be fixed, and let d = [1 ± e) logn/loga"-^, where a = ||T/>||, 
as above. As before, let to = Alogn, where ^ is a constant chosen large 
enough for our bounds to hold. Having chosen A, let A' be another constant 
chosen large enough that the bounds below hold. 

Let be the probability that the branching process is alive and 
thin for h generations (as in the definition of td,n), but that these first h 
generations contain more than A' logn particles. The proof of Lemma [l4.13l 
shows that if A' is chosen large enough, then < n~^^ for any h. Indeed, 
the argument leading to 1)14. 12|) gives a corresponding bound on t^^ with 
Qr replaced by the probability of an appropriate event, defined as qr, 
but with the extra condition that there are at least A'logn/2 particles in 
the first r generations. (We take A' large enough that Luj < A'logn/2.) 
Then, as before, can be bounded by n'^^^^ times the probability that 
survives for exactly r generations and contains at least A'logn/2 particles. 
Using only the exponential decay of p>k{k), this is at most n~^^^ if A' is 
large enough, for any r. 

Let us say that a tree T is relevant if T has height d, is thin, and contains 
at most yl'logn vertices. As T has O(logn) vertices, we have p'{T) = 
(T), as before. The sum of vr'(T) over all relevant T is exactly 

by Lemma 114.131 and our bound on above. As before, the sign above is 
the opposite of the sign we choose in d = (1 ± e) log n/ log a~^. It follows 
that the sum a of p'{T) over relevant trees is n~^^'^~^°^^\ 

To prove theupperboundin Theorem 114. IH let d = ( 1 + e ) log n/loga^^. 
Then we have na = o(l), so whp no vertex of Gn has neighbourhoods 
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consistent with a relevant tree. It is easy to check that whp no vertex has 
neighbourhoods consistent with a thin tree T' of height d that is not relevant 
(because it contains more than A'logn vertices): any such T' contains a 
subtree T" given by the first h generations of T' for some such that 
T" is thin and has between ^'logn and A' \ogn + u) = O(logn) vertices. 
But then p'{T") = n°^^K'{T"), and Et"^'(^") is at most J2h<n4,n ^ 
n~^^. It follows that whp no vertex of v has neighbourhoods consistent 
with any thin tree of height d. Thus, for every v, either Td{v) is empty, or 
t{v) < d. Applying Lemma ri4.12| the upper bound on diam(Gn) claimed in 
Theorem 114.111 follows. 



For the lower bound, we aim to find many tall thin trees attached to the 
two-core. As in Remark ll4.9l for this part of the proof we modify our notion 
of the consistency of the neighbourhoods of a vertex v of G„ with a tree T 
of height d, by disallowing edges within each Tt{v), t < d. We redefine p'(T) 
correspondingly; as in Remark Il4. 91 this changes p'{T) by a factor 1 + o(l) 
for trees of the size we consider, so our estimate p'(T) = n°^^^Tr'{T) goes 
through. 

A good tree T will be a relevant tree with height d = (1 — e) log n/ log a^^ 
in which generation d consists of a single vertex a. Changing e slightly, 
we shall assume that d is a multiple of the quantity r considered in the 
proof of Lemma 114.131 Then this proof shows that with probability at 
least (a + o(l))'^ = 7^-1+^+0(1) ^j-^g j^^^^ ^ generations of are thin and 
generation d consists of a single particle. With our bound on tj"^, it follows 
that ^j'T^'iT) = 7j^^+^+°(^), where the sum is over good trees. As p'{T) = 
77,0(1)71-' (T) for each good T, we have ^j^p'iT) = 77-i+'^+o(i)^ As in the 
subcritical case, the second moment method gives us many good trees in 
the graph, but this is not enough - they might not be attached to the two- 
core. 

For V G y{G) and T a good tree, let E2{v,T) be the event that the 
following all hold, where di = Clog log n, and C, Ci are constants to be 
chosen below: the first d neighbourhoods of v form the tree T, the single 
a e ^d{v) has two neighbours 61, 62 in ^d+i{v) each of which has at least 
uj 'descendants' in Td+diiv), and |r<rf_|_dj < (logn)*^^. We claim that 
if the constants C and Ci are chosen large enough, and is a random 
vertex of then ¥{E2{v,T)) = Q.{p'{T)). To see this, note that p'{T) 
is exactly the probability that the first condition is satisfied. Conditional 
on this happening, bounding the neighbourhood exploration below by a 
supercritical branching process shows that the existence of 61, 62 with the 
required properties has probability bounded away from zero. Finally, given 
that |r(i(f)| = 1, the expected size of the next di generations is at most 
(2supfi;)'^i = {\ogn)'-'^^\ so the claim follows. 
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Let E2{v) be the event that E2{v^ T) holds for some good T. As YItP'O^) ~ 
n~'^'^^^°^^\ we have 

¥{E2{v)) = ^¥{E2{v,T)) = n-i+^+"(i), (14.13) 

T 

where the sum is over good T. 

We would hke to show that whp there are two (in fact, many) vertices v, 
w for which E2{v), E2{w) hold. We could use the second moment method, 
but as we shah need the relevant neighbourhoods of v and w to be disjoint, 
it turns out to be easier to test vertices one by one. 

Whether E2{v,T) holds can be determined by exploring the neighbour- 
hoods of V, stopping when at most M = (logn)'-^^ vertices have been uncov- 
ered. Let us construct a sequence of t tests, t = n/M^, as follows. Each test 
starts from a vertex Vi, where the Vi are chosen independently and uniformly 
at random from V{Gn)- In the ith test, we explore the neighbourhoods of Vi, 
uncovering at most M vertices, and attempting to verify that E2{vi) holds. 
We abort the attempt if we reach a vertex uncovered in a previous attempt. 
As at most tM = njM'^ vertices have previously been uncovered, for each 
vertex we reach, the probability that it was previously uncovered is at most 
0(M~^). As the ith test involves examining at most M vertices, conditional 
on everything so far, the ith test succeeds with probability 

(1 - 0{M-'^)f'^ ^{E2{v)) ~ P(S2(^^)) = 

The number of tests that succeed dominates a binomial random variable 
with mean ^7^-i+£+o(i) — ^£+o(i) _j. ^f^Y^-, at least two tests succeed. 

Hence, whp there are vertices u, w in Gn for which E2{v), E2{w) hold, with 
the relevant neighbourhoods disjoint. 

Now whp Gn has the property that whenever E2{v) holds, the correspond- 
ing a G ^di'^) is in the two-core of G„. The argument is as for Lemma [l4.12l 
we may continue expanding the large neighbourhoods of 6i, 62 until they 
meet. Hence, whp G„ contains two vertices vi, V2 belonging to separate 
trees T\, T2 of height d, each attached to the two-core by the single ver- 
tex at distance d from the root. We shall need only this last fact, basic 
properties of the model, and Theorem 13.141 

Recall that G„ is a graph on labelled vertices {1, 2, . . . , n}, each of which 
has a type in S. Given G„ and the vertex types, let us separate G„ into 
the two-core G^, a list of trees Tj each attached to the two-core at some 
attachment vertex aj, and the rest of Gn- So far, we remember the label 
of each vertex. Now, however, let us forget the labels of the attachment 
vertices of Tj, while remembering their types. To reconstruct Gn, we 
should identify each attachment vertex with a vertex of G^ of the same 
type. Moreover, we may pick these vertices independently and uniformly at 
random from the allowed vertices of G^; this is because all possible (labelled) 
graphs formed in this way have the same number of vertices of each type. 
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and the same number of edges between vertices of each pair of types, and 
hence the same probabihty in our model. 

We have shown above that whp our hst Tj contains two trees, say Ti 
and each of which has a vertex Vi at distance d from the corresponding 
attachment vertex Oj. By (|11.8|) . whp contains Q{n) vertices. Hence, by 
Theorem l^.l^t ii)} whp almost all pairs of vertices of are at distance at 
least d' = (1 — e) logn/log ||Tk||. Hence, whp the vertices of G^ at which 
we reattach the are at distance at least d' . Thus, whp, diam(G„) > 
d{vi,V2) > 2d + d' , completing the proof of Theorem 114. Ill □ 



Our main aim in this section is to prove Theorem 13. 171 which claims that 
if a kernel k on a ground space (5, ^) is irreducible and satisfies 1)3. 11() . i.e.. 



then the function c p{c) := p{ck) is analytic except at c = cq := HTkH"^, 
that has an eigenfunction ■0 of eigenvalue WT^^W, and that every such 
eigenfunction is bounded and satisfies (|3.12|) . 

Proof of Theorem Y3.n\ Note that our assumption (|15.1|) on k implies that 
is a Hilbert-Schmidt operator, and thus compact in L^; see Lemma 15.151 

It further implies, by the Cauchy-Schwarz inequality, that is bounded 
^ L°°, and that (f^ holds for all x. 

(i) It is trivial that the function c ^ p{c) = p{ck) is analytic for c < cq, 
so we shall assume that c > cq. We shall show that we can extend this 
function to a suitable neighbourhood of c in the complex plane, and that 
this extension is (complex) analytic; this implies that /o is a real analytic 
function at c. Actually, we will show that there is an analytic map z p'^ 
into L?'{p), defined in a neighbourhood of c, such that = p^n when z 
is real. Here, as before, pzn is the function defined by Pzk{x) = p{zK;x), 
the survival probability of the branching process Xzk{x), which starts with 
a particle of type x. We may then take p^{z) := / p^{x)dp{x) as the 
extension of p. 

To show that the claimed extension exists, we will use the implicit function 
theorem for complex analytic functions in the Banach space L2. Of course, 
in this proof we use the complex version of L^. Recall that a function 
/ mapping an open subset [/ of a complex Banach space E into another 
Banach space F is analytic if and only if it is differentiable (in the Frechet 
sense) at every point in U; the derivative f'{x) at a point x G U then is a 
continuous linear operator E ^ F, see, e.g., Herve |52[ Section 3.1]. For 
background on differentiable functions in (real or complex) Banach spaces, 
see Cartan |32j : in particular, as a special case of |32[ Theoreme 4.7.1] (which 
holds in both the real and complex cases), we have the following. 



15. The phase transition 




(15.1) 
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Lemma 15.1 (The implicit function theorem). Let B be a complex Banach 
space and let fo ^ B , zq £ C Let ^ : C x B ^ B be an analytic function, 
and denote by D2^{z,f) the partial derivative of ^ with respect to the sec- 
ond variable, i.e., let D2^{z,f) be the derivative of f ^ ^'(z,/). Suppose 
that ^(zo,fo) = and that D2'^{zo, fo) is invertible. Then there exists a 
neighbourhood U of zq and an analytic function z f{z) defined in U such 
that f{zo) = fo and ^{z, f{z)) = 0, z eU. □ 

For convenience, note that by replacing k by ck, we may assume that 
c = 1 (and thus cq < 1). We then apply Lemma 115.11 with B = L'^{p), 
zo = c = 1, /o = and ^-(2;, /) = $«(z/) - /, where ^>«,(/) = 1 - e'^'^f as 
above. Since T^ is a bounded linear map L^ L°°, and g is analytic 
L°° L°°, $ is analytic L°° C and thus ^ is analytic. It is also 

easily seen that D2^{z, f) = z^'^{zf) — /. It remains to show that that the 
partial derivative at (1,/Ok) is invertible; we state this as another lemma. 

Lemma 15.2. Assume that k is irreducible, 1)15. 1() holds and cq = ||Tk||^^ < 
1. Let p be the measure dp = e'^"^'^ dp on S. Then ||*I?k(Pk)IIl2(^) < 1, and 
hence -D2^(1)Pk) = ^k(Pk) — I is invertible in L'^{p) and in L'^{p). 

Proof. Since maps into L°°, we have T^/Ck G Hence L'^{p) = 

L'^{p) with equivalent norms. 
We have 

Kif)i9) = e-'^'^^T^g. (15.2) 
Hence, for any g,h £ L'^{p) = L^{p), writing h for the complex conjugate of 
h, we see that 

{'^'Mi9),h)LHf.) = l^e-^-'-{T^g)he^^f>^ dp = jjT^g)hdp 

is a hermitian form in g and /i, because is a symmetric operator in L'^{p). 
Hence $^(pk) is a symmetric operator in L'^{p). Furthermore, as remarked 
above, is compact in L'^{p) and thus also in L'^{p). 

If we had > !> there would thus be an eigenfunction g with 

an eigenvalue A with |A| > 1. Let h := \g\ > 0, so J h > 0. Since 'I>'^(/3k) has 
a non-negative kernel by 1)15. 2() . we obtain 

h < \Xg\ = \K{P>^)9\ < K{P^)h = e-'^-P-T^h. (15.3) 

Moreover, > a.e. and thus, by part (iii) of Lemma l5.6( T^p^ > a.e. 
and 

T^P. < e^^P^ - 1 = e^^P-^^p^ = e^^P-p^ a.e. (15.4) 
Multiplying 1)15. 3() and ()15.4|) and integrating, recalling that J h > 0, we 
obtain 

j hTi^Pf,dp< j p^T^hdp, 

which contradicts the symmetry of T^. 

This contradiction shows that ||^*K(pK)||L2(/i) < 1- Hence I — <I>^(pk) is 
invertible in L^ (p) = (p) , completing the proof of Lemma 115.21 □ 
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Continuing the proof of Theorem \'A.17\ we can now apply the imphcit 
function theorem (Lemma 115.1)1 to conclude the existence of an analytic 
function z i— > G -^^(a*) with <I>k(-Z/o^) = pf^ defined in a complex neigh- 
bourhood U of 1. We may further (by continuity) assume that U is so small 
that ||p^||2 > 0. For real z G U, we thus have ^znipt) = ^K{zpt) = pt 
(in L^, i.e., a.e.), so pf = pzK a-e. by Theorem 16.11 and Remark 15.51 which 
completes the proof of (i). 

(ii) This time we scale so that ||Tk|| = 1, and thus cq = 1, and write pi+g 
for we assume below that < e < 1. Thus, by Theorem 16. H 

1 _ e-(l+.)T.pi+. ^ ^.(i+,),(pi+,) = 



I.e. 



where 



(1 + e)T^pi+e = - ln(l - pi+e) = Pl+s + Ripi+e) (15.5) 

R{f):=^ + ^ + ... (15.6) 

By Theorem 16.11 pi+£ > a.e. when e > 0, but pi = 0. By Lemma 15.151 
there exists an eigenfunction £ L"^ with ^p = T^^^p, which now implies 

G L°°. Furthermore, ip is determined up to a constant factor, so the 
coefficient J tp f ip"^ / J ip'^ does not depend on the choice of ip. We will for 
convenience assume that ip is chosen with ip >Q and J ip'^ dfi = \\ip\\2 = 1- 

The operator maps the subspace ip-^ C L'^{p) into itself; let denote 
the restriction of to this subspace. Then 1 is not an eigenvalue of T^, and 
thus (since is compact), 1 does not belong to the spectrum of T^, i.e., 
I — Tp. is invertible. By continuity, I — (1 + e)T^ is also invertible for small 
e, and there exists 6 > and C < oo such that || (/ — (1 + e)?"^) < C 

for < e < 5, i.e., 

II/II2 < C\\ (/ - (1 + e)T«)/||2, 0<e<5, fG^P^. (15.7) 

Theorem l6.4l imDlies that pi+e \ Pi a.e. as e \ 0, and thus, by dominated 
convergence, 

\\pi+sh^O. (15.8) 

We also have, by 1)15. 5() . pi+s < (1 + £)TKPi+e and thus, as is bounded 
from L2 to L°°, 

||Pl+e||oo < (1 + s)\\TKPl+e\\oo < Ci II2. (15.9) 

In particular, by (|15.8j) . H/Oi+^Hoo ^ as e — > 0. 

Assume that e > is small enough to ensure that ||pi+e||oo < 1/2- Then, 
by ()15.6|1 . i?(pi+e) < /of+g, and thus, using ()15.9|) . 

\\R{pi+s)h < ll/^f+.lloo = \\pi+e\\lo < C2\\pi+e\\l (15.10) 

Let Q be the orthogonal projection onto ip-^ and let p1_^_^ := Qpi+g. We 
thus have the orthogonal decomposition 

Pl+e = aelp + Pl+s, (15.11) 
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where 



ae = {pi+e,i^) = J pi+ei'd^. (15.12) 

Hence < < ||/9i+e||2. 

Applying the projection Q to (|15.5|) we find, since QT^ = Tf^Q, 

(1 + e)T,pl+, = pI_^, + Q{R{pi+e)) 

and thus by 1)15. 7|) and ()15.1U|) . for e < 6, 

Wpl+eh < C\\{I - {l + e)T^)pl^,h = C\\Q{R{p^+M2 < Cs\\pi+e\\l 

(15.13) 

Consequently, by 1)15. 11() and ()15.8|) . as e ^ 0, 

as = Waetph = \\pi+e - Pi+sh 

= Wpl+eh + 0{\\pI^^\\2) = Wpi+eh + OiWpi+eW'i) ~ llPl+elb- 
Furthermore, recalling p5.12j) (twice), ip = T^Tp, and (|15.5|) . 
(l + e)a, = (l+e)(T«V,Pi+.) = (1 + e)T«/>i+,) 

= (V', Pl+e) + (V", R{Pl+e)) = + (V', R{Pl+e))- 

Therefore, appealing to (IKB . (115.111) . ([TK1I|) . (115.1311 and (115.1411 . we find 
that 

eae = {iP,R{pi+,)) = {'>P,\pI+,) + 0{\\pI+,\\oo) 

= \ali,^)) + aei^pl^,) + ii', Upl+ef) + OdIpi+.IlL) 



2^ „_ n (15.14) 



ia^ y" + 0(a,||pl+,||2) + Oi\\pl^,\\l) + 0(||pi+, 
ia^ ji;^dp + Oias\\pi+e\\l) + 0(||m.ll!) + 0(||/9i+, 



ellooJ 



,,3 ) 



ia^ / ?/;3(i/i + 0(a3). 



If e > is small enough then, by ()15.14() . we have > 0, and so we can 
conclude that 

e = la, j dp + 0(4). (15.15) 

Finally, let e \ 0. Then ()15.14p and ((13?^)) imply that ^ 0, and so 
from ()15.15() we see that e ~ f il)^ dp = ©(a^), and, more precisely, 

a, = j^^e + 0{e\ (15.16) 
Consequently, using (I15.11jl . (n5.i:-{|l . (115.1411 and (I15.1tijl . 

/3(l+e)= / pi+edp = ae / / Pl+eC?^ 

= aj ^pdp + 0{\\pUM = 2jPf^e + 0{e% 
proving (|3.12|1 , and so completing the proof of Theorem 13.171 □ 
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As noted in Subsection VAM Theorem K-i.171 has a simple consequence, 
Corollary 13.181 showing that the rate cop'j^{co) of emergence of the giant 
component at the phase transition is maximal in the Erdos-Renyi case, 
and, more generally, when (|4.2)) holds; see Example 14.61 

Proof of Corollarv \3.1tA Our aim is to show that if n is an irreducible kernel 
on a ground space {S,fi) for which ()15.1|) holds, and cq := ||Tk||~^ > 0, then 

cop'+{co) < 2. 

By Theorem 13.171 00/5+ (cq) = 2 J^ip /^V'^/ Is^'^- Lemma [5.151 we 
may assume that ip > 0. Then, by Holder's inequality, fgip < (/^ ip^) 

and Jgip"^ < ils^^) ' '^ith equality if and only if tp is a.e. constant, i.e., 
if and only if the constant function 1 is an eigenfunction of T^, which is 
equivalent to □ 

Turning to the number of edges at the phase transition, we shall next prove 
Proposition 1231 which says that if Kifi IS Si graphical sequence of kernels on a 
vertex space V with limit k, and ||Tk|| = 1, then ^e(G^(n,K„)) ^ ^ // k < 
1/2, with equality if and only if (|4.2|) holds. 

Proof of Proposition By Proposition 18.91 we have e(G^(n, k^)) /n 7, 
where 1 ■= ff i^- If II^kII = 1) then 

1 / / „ _ 1 /I 7^ -|\ < i||7^ II _ 1 



2 // ~ 2 \ ' / — 2ll-^K|l ~ 2' 

with equality if and only if the constant function 1 is an eigenfunction, i.e., 
if and only if holds. □ 

Remark 15.3. Proposition 13.41 savs that the number of edges at the phase 
transition is largest in the classical Erdos-Renyi case, and is strictly smaller 
in all other cases except some very homogeneous ones. Together, Corol- 
lary 13.181 and Proposition 13.41 say roughly that inhomogeneities make the 
giant component appear sooner, but grow more slowly (initially, at least). 

Remark 15.4. Another way to study the number of edges when the giant 
component is born is to consider the graph process in Remark 12.61 Let us 
stop the growth when the largest component first has at least Lv{n) vertices, 
where u;(n) is a function chosen in advance, with uj[n) = o{n), and u){n) 
increasing sufficiently rapidly with n. (If k is bounded, we can take any oj 
with logn <C uj{n) <C n, see Theorem 13.121 ) Then, for any e > 0, whp we 
stop at a time between (cq — e)/n and (cq -|-e)/n, where cq := ||Tk||~"'^, and it 
follows easily from Proposition 18. 91 that if N is the number of edges when we 
stop, then N/n ^ // k. Again we see that the number of edges required 
for a giant component is largest in the homogeneous case. 

Remark 15.5. The proof of part (i) of Theorem 13.171 shows that the func- 
tion p{cK,; ■) defined before 1)2. 13() depends analytically on c / cq as an ele- 
ment of L^. Hence in Theorem 19. 101 also depends analytically on c 7^ cq. 
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We expect the following extensions of Theorem 111 171 to hold. 

Conjecture 15.6. Theorem \3. IT^ i) holds without the condition p.llf) . i.e., 

p{c) is always analytic except at cq. 

Conjecture 15.7. Let k he an irreducible kernel on a ground space 
Then equation ()3.12() holds with the larger error term o[e) whenever 
has an eigenf unction Tp of eigenvalue \\Tf^\\ with Jgip"^ < oo; conversely, 
P+{co) = i/ Co > but no such ifj exists or ip exists with Jgip"^ = oo. Cf. 
Subsection \16.4\ 

16. Applications and relationship to earlier results 

In this section we apply our general results to several specific models that 
have been studied in recent years, and describe the relationships between 
our results and various earlier results. 

16.1. Dubins' model. A common setting is the following: the vertex space 
V is (x„)„>i), where S = (0, 1], p is the Lebesgue measure, and x„ = 

(xi, . . . .,Xn) with Xi = i/n. In this case, (|2.;-i|) gives pij = K{i/n.,j/n)/n A 1 
for the probability of an edge between vertices i and j. We shall consider 
several choices of k in some detail. 

Observe first that if k is a positive function on (0, oo)^ that is homogeneous 
of degree —1, then (|2.,3|) yields pij = K{i,j) A 1. Since this does not depend 
on n, in this case we can also consider the infinite graph G{oo,k), defined 
in the same way as G„ = G^(n, k) but on the vertex set {1,2,...}. Note 
that the graphs G^(n, k) are induced subgraphs of G(oo, k) and that we can 
construct them by successively adding new vertices, and for each new vertex 
an appropriate random set of edges to earlier vertices. 

We first consider k{x, y) = c/{x V y) with c > 0, so that if j > c then 

Pij = c/j for i < j. (16.1) 

In this case we can regard G^(n, k) as a sequence of graphs grown by adding 
new vertices one at a time where, when vertex k is added, it gets Bi(A;— 1, c/k) 
edges, whose other endpoints are chosen uniformly among the other vertices. 
(We might instead take Po(c) A (fe — 1) new edges, without any difference in 
the asymptotic results below.) 

This infinite graph G{oo, k) was considered by Dubins in 1984, who asked 
when G(oo,k) is a.s. connected. Dubins' question was answered partially 
by Kalikow and Weiss [HO]. A little later Shepp [Sni proved that G(oo,k) is 
a.s. connected if and only if c > 1/4. This result was generalized to more 
general homogeneous kernels by Durrett and Kesten j46j . 

The finite random graph G^(n, k) with this k, i.e., with edge probabilities 
given by (|16.1|) . has been studied by Durrett who points out that it 
has the same critical value c = 1/4 for the emergence of a giant component 
as the infinite version has for connectedness, and by Bollobas, Janson and 
Riordan who rigorously show that this example has a phase transition 
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with infinite exponent. More precisely, denoting p{k,) by p{c), it was shown 
by Riordan [HH that 

/9(l/4 + e) =exp(-§e-i/2^0(loge)). (16.2) 

A similar formula for the closely related CHKNS model (see Subsec- 
tion ll6.^^ . introduced by Callaway, Hopcroft, Kleinberg, Newman and Stro- 
gatz |30| . had been given earlier by Dorogovtsev, Mendes and Samukhin 
using non-rigorous methods. 

To find the critical value by our methods, we have to find the norm of on 
L2(0, 1). Using the isometry U : f ^ e-^/V(e"''') of L'^{0, 1) onto L2(0, oo), 
we may instead consider Tf^ := UT^U~^, which by a simple calculation is 
the integral operator on L^(0, oo) with kernel 

k{x, y) = e-^/2^(e-^ e-y)e-y/^ = ce-^/2-y/2+xAy ^ ce'l^-^l/^. (16.3) 

Hence is the restriction to (0, oo) of the convolution with h{x) := ce"'^'/^. 
Because of translation invariance, it is easily seen that has the same norm 
as convolution with h on L^(— oo,oo), and taking the Fourier transform we 
find 

/oo 
h{x) dx = 4c. 
-oo 

Thus, Theorem 13.11 shows that there is a giant component if and only if 
c > 1/4, as shown in Durrett fllj and |iy|. 

To find the size of the giant component is more challenging, and we refer 
to Riordan |H11 for a proof of ifTH^ . Note that the hypothesis ^TT^i of 
Theorem 13.171 fails, as do the conclusions in part (ii). Indeed, it is easy to 
see that is a non-compact operator, and that it has no eigenfunctions 
at all in L^. We suspect that this is connected to the fact that the phase 
transition has infinite exponent. 

16.2. The mean-field scale-free model. Another interesting case with a 
homogeneous kernel as in Subsection 116.11 is K{x,y) = cj ^Jxy with c > 0; 
then, for ij > c^, we have 

Vij = c/ ^/ij- (16.4) 

This model has been studied in detail by Riordan |84j. Considering the 
sequence G^(n, k) as a growing graph, in this case, together with each new 
vertex we add a number of edges that has approximately a Poisson Po(2c) 
distribution; the other endpoint of each edge is chosen with probability 
proportional to which is approximately proportional to the degree 

of vertex i. Hence, this random graph model resembles the growth with 
preferential attachment model of Barabasi and Albert [2], which was made 
precise as the LCD model by Bollobas and Riordan 1^; see also 
fact, up to a factor of 1 + o{i~^) in the edge probabilities, the model defined 
by (|16.4jl is the so called 'mean-field' version of the Barabasi-Albert model, 
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having the same individual edge probabihties, but with edges present inde- 
pendently. This (by now common) use of 'mean-field' is not the standard 
one in physics, where it normally means that all vertices interact equally. 
(So the mean-field random graph model is G{n,p).) 

In this case, is an unbounded operator, because ^^(0, 1), and 

thus there is no threshold. In other words, p{c) := p(/t) > for every c > 0. 

As shown by Riordan [Slj, p{c) grows very slowly at first in this case too; 
more precisely, 

p{c) ~ 2e^-^ exp(-l/(2c)) as c ^ 0, (16.5) 

where 7 is Euler's constant; see also Subsection 116.41 below. The result in 
|84j for the Barabasi-Albert model is different, showing that in this model 
the dependence between edges is important. 

Remark 16.1. Random graphs related to the ones defined here and in 
Subsection 116.11 but with some dependence between edges (and thus not 
covered by the present paper) can be obtained by adding at each new vertex a 
number of edges with some other distribution, for example Bi(m,p) for some 
fixed m and p. Such random graphs have been considered in I 22| 1251 1411 184j , 
and these papers show that not only the expected numbers of edges added 
at each step are important, but also the variances; the edge dependencies 
shift the threshold. 

16.3. The CHKNS model. We next consider the CHKNS model of Call- 
away, Hopcroft, Kleinberg, Newman and Strogatz jHOj- Here, the graph 
grows from a single vertex; vertices are added one by one, and after each 
vertex is added, an edge is added with probability 6; the endpoints are cho- 
sen uniformly among all existing vertices. (Multiple edges are allowed; this 
does not matter for the asymptotics.) 

Following Durrett we consider a modification (which is perhaps at 
least as natural): after adding each vertex, add a Poisson Po((^) number of 
edges to the graph, again choosing the endpoints of these edges uniformly 
at random. Thus, when vertex k is added, each existing pair of vertices 
acquires Po(^6/(^^^ new edges, and these numbers are independent. When 
we have reached n vertices, the number of edges between vertices i and j, 
with 1 < i < J < n, is thus Poisson with mean 

and the probability that there is one or more edges between i and j is 
Pij := 1 - exp(-eij). 
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Hence, ignoring multiple edges, we have a graph G„ of our type, with 
5 = (0, 1], /i Lebesgue measure, Xi = i/n and 

'^-(^'^^ ■= - ^"PH( n(xv'y)-l -\))) 

^K{x,y):= 25(^-1). (16.7) 

The conditions of Theorem 13.11 are immediately verified, and thus 
Ci{Gn)/n^ p{k). 

Instead of adding a Poisson number of edges at each step, we could add 
a binomial number by adding, after vertex k, each possible edge with prob- 
ability (5/ (2)- We obtain the same results with slightly different k„ but the 
same k. _ 

The original CHKNS model, Gn, say, can be treated by the argument in 
[ini. It follows that Ci(G„)/n ^ p{k) holds for the CHKNS model too. 

In particular, the threshold for the CHKNS model, as well as for Durrett's 
modification, is given by \\T^\\ = 1, or 25 = ||T||^^, where T is the integral 
operator with kernel l/(xV?/) — 1 on L^(0, 1). This kernel is strictly smaller 
that the kernel l/(x V y) considered in Subsection 116.11 However, chang- 
ing variables as in 1)16. we see that T is equivalent to the operator on 
L^(0, 00) with kernel e"'^"^'/^ — e~*^^^^^/^. Using translational invariance of 
the operator with kernel e"'^"^'/^ considered in Subsection ll6.lL considering 
functions supported in (ii, 00) and letting i? — > 00, it is easily seen that T 
has the same norm as this operator, namely 4. 

Thus the thresholds for the CHKNS model and Durrett's modification 
are both given by 25 = 1/4, i.e. 5 = 1/8, as was found by non-rigorous 
arguments by Callaway, Hopcroft, Kleinberg, Newman and Strogatz |30j 
and Dorogovtsev, Mendes and Samukhin |43j . and first proved rigorously by 
Durrett (441; see also [T9] . 

To study the size of the giant component in these models, let us write 
no{x^ y) := l/{xV y) and Ki{x,y) := l/(x V y) — 1. Then ki < kq, and thus 
p(cKi) < p{ckq) for each c > 0. (We have strict inequality for c > 1/4, see 
Remark l3.8| note that, as pointed out by Durrett we have the same 
threshold 1/4 for both kernels although we have twice as many edges in 
G(n, ckq) as in G{n,cKi).) On the other hand, let ry > and consider only 
vertices i < j < rjn. Then 

n CKiii n,j n) = c > . 

\j nJ J 

Hence, cf. (|16.1|) . G(n, cki) 13 G{r]n,{l — r])cKQ). (Note that these graphs 
have different numbers of vertices.) Thus, for every rj with < r/ < 1, 

p(cko) > p{cKi) > 7]p{{l - rj)cKo). 

Taking c = 1/4 + e and rj = e^, relation (|16.2|) for p{cKo) implies the same 
estimate for p{cKi); in other words, if 6 = 1/8 + e, then the size of the giant 
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component is given by 



p{k) = p(2(5ki) = exp 



2V2 



vr 



£ 



/2 + 0(loge)) 



As noted in Subsection ll6.H a similar formula (with no error term, and a 
particular constant in front of the exponential) was given by Dorogovtsev, 
Mendes and Samukhin with a derivation part of which can be made 
rigorous; see Durrett [Hj . 

16.4. The rank 1 case. In this subsection we consider a special case of 
our general model that, while very restrictive, is also very natural, and 
includes or is closely related to many random graph models considered by 
other authors. This is the rank 1 case, where the kernel k has the form 
k(x, y) = tp{x)tp{y) for some function -0 > on 5. We shall assume that the 
kernel is graphical; in particular we assume J tp dfi < 00, but not necessarily 
that J tp'^ dfi < 00. 

The function tp{x) can be interpreted as the "activity" of a vertex at x, 
with the probability of an edge between two vertices proportional to the 
product of their activities. In the rank 1 case, T^/ = (/ fip)ip, so 



Thus is bounded if and only if ^ G L^, in which case has rank 1, 
so it is compact, and ^p is the unique (up to multiplication by constants) 
eigenf unction with non-zero eigenvalue. 

By Theorem 13.131 the distribution of vertex degrees is governed by the 
distribution of the function A(x) = (J tp diJ,)'ijj{x) on (5,/x). In particular, by 
Corollarv ll3.11 the degree sequence will (asymptotically) have a power-law 
tail if the distribution of A(x) has; for example, if 5 = (0, 1] with fi Lebesgue 
measure, and 'ip{x) = cx~^^p. (Another, perhaps more canonical, version is 
to take ^p{x) = x on S = [0, 00), with a suitable finite Borel measure /i. 
Note that every random graph considered in this example may be defined 
in this way, since we may map S to [0, 00) by x 1— > ip{x). Alternatively, we 
may map by x 1— > A(x) and have ip{x) = cx with c > and A(x) = x.) 

Random graphs of this type have been studied in several papers; we shall 
not attempt a complete list, mentioning only several examples. Chung and 
Lu |34j and Norros and Reittu 82 give results on the existence and size 
of a giant component. Britton, Deijfen and Martin-Lof |22j use (|2.7|) with 
^(a:;, y) = ip{x)ip{y) to define a random graph, and observe that conditioned 
on the vertex degrees, the resulting graph is uniformly distributed over all 
graphs with the given degree sequence; they further prove a version of The- 
orem E^Sl for this case. 

Actually, in [H^ and [H2] the edge probabilities pij are given by pij := 
WiWj/Y^'^=iWi^ with Wi deterministic in and random in |H2j. Under 
suitable conditions on the wi, these examples are also special cases of our 




(16.8) 
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general model. For suitable deterministic sequences {wi)i, we use Defini- 
tion [221 we omit the details. For random i.i.d. Wi, as in [HS!, if we further 
assume Kwi = uj < oo, we can, for example, let S = [0, oo), /_i = C{wi), 

Xi = Wii^jWj /nuS) and ^{x) = lo'^I'^x. Then K{x,y) = xy/oj, and 
we have A(x) = x in Theorem 13.131 and thus A = in Corollary 113.11 
and H ~ Fo^wi). Furthermore, from (|16.8|) the norm of is (essentially) 
the 'second order average degree' d = Yl'^f / Thus, for example, 

the result of Chung and Lu 1351 136j that, under certain assumptions, the 
typical distance between two vertices of the model G(w) studied in 
is logn/log((i) corresponds to Theorem 13.141 (Chung and Lu also study 
sequences Wi falling outside the scope of our model.) 

Chung and Lu ^37^ give a result for the 'volume' X^igCi 
ant component Ci of G(w). This result corresponds to Theorem 19.101 with 
f{x) = A(x); indeed, under certain assumptions on the Wi, it is implied by 
Theorem 19.101 Unfortunately, the statement of the result in 37 is incom- 
plete, as no conditions on the Wi are given. It is not clear what the right 
conditions are; certainly some restrictions are needed. 

The random graphs G(n, k) obtained from rank 1 kernels should be com- 
pared to the random graphs with a given (suitably chosen) degree sequence 
(dj)", studied by, for example, Luczak [72], Molloy and Reed [771 [TS] and (in 
the power-law case) Aiello, Chung and Lu Note that in this model, the 
probability of an edge between i and j is roughly didj/n, but there are de- 
pendencies between the edges. It was shown by Molloy and Reed [77j that, 
under some conditions, the threshold for the existence of a giant component 
in this model is 'Yli^iidi — 2) = 0. This fits well with our result, although 
we see no strict implication: Theorem 13. 131 shows that, for our model, the 
degree of a random vertex converges in distribution to a random variable H 
with the mixed Poisson distribution J"^ Po(A(x)) (i/i(x). If X ~ Po(A), then 
E(X(X - 2)) = E(X(X - 1) - X) = A^ - A, so 

E(H(H - 2)) = ^ (A(x)2 - A(x)) d^i{x) = (^Jj dfj)j ' d/^ - l) , 

which vanishes when f^tp'^ = 1. As ||Tk|| = J^tp'^, this is indeed the thresh- 
old for the emergence of a giant component in our model. The result of 
Molloy and Reed ^77. that, in the supercritical case, the second largest com- 
ponent has size O(logn) corresponds to Theorem 13.1 ^1 ii j| again there is no 
strict implication, but the kernels k corresponding to the graphs studied by 
Molloy and Reed satisfy inf k(x, y) > 0. In a subsequent paper, Molloy and 
Reed 78 gave further results on the size of the giant component and on the 
structure of the remainder of the graph, corresponding to our Theorems 13. 11 
and inn 

The following variant of this model has also been studied: the degrees are 
first chosen according to some distribution, and then the graph is chosen 
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uniformly among all graphs with the resulting degree sequence; see, for ex- 
ample, the results of Van der Hofstad, Hooghiemstra and Van Mieghem |53j 
and of Fernholz and Ramachandran on distances and diameter, respec- 
tively, mentioned in Section ITU 

Yet another variant of the rank 1 case of G^(n, k) was studied rather 
earlier by Khokhlov and Kolchin |641 I65j . who proved results about the 
number of cycles; see Section ITTI 

In the rank 1 case, the size of the giant component (if any) of G^(n, k) can 
be found rather easily. In order to study the phase transition, let us consider 
the kernel CK{x,y) = cip{x)ip{y), with c > a parameter. By Corollarv 13.21 

the threshold for c is cq = ||Tk||^"'^ ~ ( ) • ^ — "^O; let 



a(c) '■= c J ippcndfi, (16.9) 

where, as before, Pck{x) = picK; x) is the survival probability of the branch- 
ing process Xenix)- 

We have TckPck = cTi^pcn = a{c)ip. Thus, by Theorem 16. II and 1)16. 9() . 

Pc. = = 1 - e-^^-P^'^ = 1 - e-"(^)^. (16.10) 

(The condition 1)5.11) holds for every x.) Let 



/?(t) :=y (^1 -e-*^(^))V(2;)d/i(x), t>0. (16.11) 
Then, by (tTHll and (|16.1()l) . 

Pc^il^dp = cP{a{c)), (16.12) 



Q C = C 



5 



SO c = q(c)//3(q(c)) , i.e., a is the inverse function to t i— > ^(t) := t/[3{t). 
Since (3 is explicitly given by 1)16. llj) . for any c > cq this gives (at least 
in principle) a(c), and hence, by ()16.1U|) . the function pcn- Then p{cti) = 
j's PcK.{x) dp{x) determines the asymptotic number of vertices in the giant 
component. Similarly, by Theorem 13. 5| the asymptotic number of edges in 
the giant component is determined by C{ck), which by the definition (|3.3|) 

and mrni is given by 

({ck) = c J i^Pctidp j iidp-^l^j iipcndp^ =a{c) j il^dp-^^^^. 

(16.13) 

Moreover, the asymptotic value of Yli^Ci fi^i)/^ given by Theorem 19.101 
for suitable functions /. 

We now turn to asymptotics as c \ cq, in order to study the phase 
transition more closely. Note that pcn \ a.e. as c \ cq by Theorem 16.41 
so, by dominated convergence, 

a{c)/c\0 as c \ cq. (16.14) 
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Further, by and dominated convergence, 

44= /" M^Mx)^ I i^{x) dfi{x) > asc\co. (16.15) 
«(c) Js a(c) Js 

Consequently, the behaviour of /? at determines, through 7 and a = 7""^, 
the behaviour of p{c) as c \ cq. Note that, by (|16.11() . (3 is continuous with 
/3(0) = and 

P'{t) = J e-''^^''^\x)dfi{x), t>0. (16.16) 

Moreover, from (|16.13|) we have 

C{ck) = a{c) / il^dfi — ~ a(c) / ip d^i ^ p{c) 

Js Js 

as c \ Co, where the first ~ is from ()16.14() and the second from (|16.15|) . 
Hence the asymptotics are the same as for the number of vertices; see Re- 
mark E^Sl 

Let us consider some concrete examples. Once again, we take 5 = (0, 1] 
with n Lebesgue measure, and let il){x) = x~^^^ where 1 < p < 00. We shall 
use C, Ci, etc. to denote various positive constants that depend on p. 

Case 1: 1 < p < 2. In this case, HV'lb = 00, so cq = 0. As t — > 0, by 

mm, 

f3'{t):= e-*^"'^'x-2/P(ix=p / e'H~^+Py^-Pdyr.CtP-^, (16.17) 
Jo Jt 

noting for the last step that the integral /q°° e~^y^^*' dy is convergent. Thus 
I3{t) ~ CitP-^ and -f{t) = t/f3{t) ~ C2t^-P. Consequently, using (TTbT^Ii . 

p{c) ~ Cza{c) = C37"^(c) ~ C4C^/(2-P) as c ^ 0. 

Note that this exponent 1/(2 — p) may be any real number in (l,oo). 

Case 2: p = 2. This is the case (|16.4|) studied in Subsection 116.21 and 
[Hlj . We still have HV'lb = 00 and thus cq = 0. In analogy with (|16.17|) we 
now find that I3'{t) ~ 21n(l/t) as t ^ 0. This yields I3{t) ~ 2tln(l/t) and 
7 ~ l/(21n(l/t)) as i ^ 0, and thus a(c) = 7"Hc) = e'^^+^^^^^/^c 

p(c) = e-(i+°(i))/2'= as c ^ 0. 

More refined estimates can be obtained in the same way, see ()16.5|) and |84j . 

Case 3: 2 < p < 3. For p > 2 we have f ip'^ dp < 00, and thus cq > 0, 
so we have a phase transition. (In fact, cq = 1 — 2/p.) By (|16.16|) . (3'{t) is 
continuous for t>0 with /3'(0) = J ip"^ dp = Cq^. Differentiating once more 
we obtain as t ^ 

rl poo 

j3"{t) =- e-^^'^'^'x-^/P dp{x) = -p e-H-^+Py'^-P dy ^ -CtP-^ 
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and thus /3'(t) = Cg ^ - (Ci + o{l))tP~^ and (3{t) = CqH - {C2 + o{l))tP-\ 
Hence -f{t) = t/P{t) = cq + (C3 + 0(1))*^"^. Consequently, using (|16.15j) . 

p(co + e) ~ C4a(co + e) ~ Cge^Cp-s) as e \ 0. 

We thus have a phase transition at co with exponent l/{p—2). Note that this 
exponent may be any real number in (1,cxd). (Taking instead e.g. ^l^{x) = 
x~^/'^ln~^(e^ /x), it is similarly seen that there is a phase transition with 
infinite exponent.) 

Case 4- p = 3. Similar calculations show that, as t 0, (5"{t) ~ 31nt, 
p'{t) = 3 - (3 + o(l))t ln(l/t), P{t) =3t- (3/2 + 0(1))*^ ln{l/t), and j{t) = 
1/3+ (1/6 + o(l))tlnl/t. Consequently, with cq = 1/3, 

p{co + e) ~ Ca{cQ + e) ~ Cis/ ln(l/e) as e \ 0, 

so p'(co) = 0. 

Case 5: 3 < p < 00. In this case, / d/i < 00 and we find as t ^ 0, 
f3"it) ~ -C, /5'(t) = Co ^ -{C + o(l))t, /3(t) = Co - (C + o(l))tV2, and 
7(0 = Co + (Ci + o(l))t. Consequently, /j(co + e) ~ C2a(co + e) ~ Cse, so we 
have a phase transition with exponent 1. This is similar to Theorem 13.171 
although 1)3. lip is not satisfied (except in the classical case p = 00). Indeed, 
it can be checked that 1)3.12(1 holds, except that the error term may be larger 
(it is e(eP-2) for 3 <p < 4). 

More generally, the same argument shows that (|3.12() holds for any rank 
1 kernel ijj{x)ip{y) with J ip"^ dfi < 00, provided the error term is weakened 
to o(e). (The error term is O(e^) if / tp^ dfi < 00.) 

16.5. Turova's model. Turova [03 IHS IHS IHOj has studied a dynamical 
random graph G{t), t > 0, defined as follows, using three parameters 7 > 0, 
A > and 5 > 0. The graph starts with a single vertex at time t = 0. Each 
existing vertex produces new, initially isolated, vertices according to a Pois- 
son process with intensity 7. As soon as there are at least two vertices, each 
vertex sends out edges according to another Poisson process with intensity 
A; the other endpoint is chosen uniformly among all other existing vertices. 
(Multiple edges are allowed, but this makes little difference.) Vertices live 
for ever, but edges die with intensity 6, i.e., the lifetime of an edge has an 
exponential distribution with mean 1/6. (All these random processes and 
variables are independent. We use 5 for Turova's /i to avoid confiicts with 
our notation.) 

By homogeneity we may assume 7 = 1; the general case follows by re- 
placing A and 6 by A/7 and 6/j and changing the time scale. 

Our analysis of the random graph G{t) is very similar to that of Soderberg 
[88j : our theorems enable us to add technical rigour to his calculations. 
The vertices proliferate according to a Yule process (binary fission process): 
writing N{t) for the number of vertices at time t, the probability that a new 
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vertex is added in the infinitesimal time interval [t, t + dt] is N{t) dt. It is 
well-known (see, e.g., Athreya and Ney ^8i, Theorems III. 7. 1-2]) that 

e'^N{t) "-^W as t ^ oo (16.18) 

for a random variable W with W > a.s. (In fact, W ~ Exp(l), but we do 
not need this.) 

We condition on the vertex process, and assume, as we may by (|16.18|) . 
that 

e"*iV(t) -^w as t ^ oo (16.19) 

for some w > 0. We take S = [0, oo) and let xi, . . . , X7v(t) be the ages of the 
particles existing at time t. For any fixed s > 0, by 1)16. 19(1 we have 

Ms, oo) := ■Xi>s} = Jf^^i* -s) ^ e-\ 

as t — > oo. This means (see Remark IA.3|) that ut where /i is the 

measure on [0,oo) given by dfi/dx = (the exponential distribution). 

If Xi < Xj, the number of edges at time t between two vertices of ages xi 
and Xj has a Poisson distribution with mean 

i-x. N{s)-1 Jo N{t-s)-l 



Set 



and 



K*{x,y) := 2A C"^^ e-^'—^^^ -ds, 

*^ ' N{t-s)-l 







Kt{x,y) = N{t){l - eM-4{x,y)/N{t))). 

Thus s-ij — K-Kxi, Xj)/N(t), and the probability that there is at least one 
edge between i and j is given by pij = 1 — e~^'i = Kt{xi,Xj)/N{t). 

By ()16.19|1 . N{t)/ [N{t—s) — l) — > e'^ as t — > oo for every s, and dominated 
convergence shows that ii 5 ^ 1 and xt ^ x, yt ^ y, then 

Kl{xt,yt)^Ks{x,y):=2X / e-''+' ds = -(e'~^-')(-^y) -l) (16.20) 



For (5 = 1, corresponding to (5 = 7 in the non-rescaled model, let ki (x, y) := 
2X{x Ay). Then K^{xt,yt) Ks{x,y) in this case also. 

Theorem 13.11 thus applies to G{t) conditioned on the process {N(t))t>o, 
and we find (conditioned on {N{t))t>o, and thus also unconditionally) that 

CijGjt)) p 

with Ks given by 1)16. 2U() . 

To study p{ks) further, and in particular to investigate the threshold as we 
vary A keeping fj, >0 fixed, we thus have to investigate the integral operator 
with kernel ks given by (|16.2n|l . The change of variables x — > 



122 



BELA BOLLOBAS, SVANTE JANSON, AND OLIVER RIORDAN 



transforms S and fj, to the standard setting (0, 1] with Lebesgue measure, 
and the kernel (|1(i.2()l) b ecomes 

2A 



Hsix, y) := 3^ [{x V y)'-' - l) , (16.21) 



with Ki := 2Aln(l/(x V y)). 

In the case 6 = 0, this is the same as 1)16. 7() : hence we have the same critical 
value 1/8 (for A) as for the CHKNS model and the same p{k) giving the 
size of the giant component; in particular, the phase transition has infinite 
exponent. (Indeed, with 6 = the model is very similar to (Durrett's 
form of) the CHKNS model discussed in Subsection 116. HI now a geometric 
number of edges between random vertices is added at each step, rather 
than a Poisson number.) For 6 > 0, the kernel ks is in L^((0, 1]^), so T-, 



KS 



is compact (see Lemma I5.15|) and its norm can be found by finding its 
eigenvalues. By the discussion in Subsection 116.61 below, this is equivalent 
to solving (|16.23|) with the given boundary values. In our case, denoting the 
eigenvalue by a, this means solving aG"{x) = —2Xx^~'^G{x) with boundary 
values G{0) = G'(l) = (since (/>(!) = 0). 

The general solution is easily written down as a linear combination of two 
hypergeometric series, and G(0) = yields (up to a constant factor) 



9ix) = G'ix) = J2 



1 / 2A , 



nlT(n + l/6)\ a5^ 



X 



2A A -{l/5-l)/2 / / 2A A 1/2 

where is a Bessel function. 

The condition g{\) = G'{1) = (which gives the formula in Turova |931 

first positive zero of J^, then 



Corollary 4.1] and (Hi) thus leads to J^/s^i ( I ) ) = 0, so if Zy is the 



8A 



\\Tk,s\\ — \\Tks\\ — "1 — ^2^2 

" n/<5-i 

In other words, the critical value of A is Acr(^) = 6"^ Zy^_-^/S, as given by a 
related argument by Soderberg 

Theorem 13.171 applies only when 6 > 1/2, but the eigenfunctions are 
continuous and bounded for every 6 > 0, and we believe that the phase 
transition has exponent 1, and that (|3.12|) holds, for every 6 > 0. 

We can easily find the asymptotics of Xcr{6) as 5 — > or cxd; see Turova 
|93] . If A,(5 — > oo with X/6 — > c > 0, then Ks{x,y) — > 2c, pointwise and 
in L2((0, 1]2), and thus ||Ts, - Tadl < WTa, - T2c\\hs ^ 0. It follows that 
for large 6, the graph is subcritical if 2c < I and supercritical if 2c > 1. In 
other words, Xct/6 — > 1/2 as 6 ^ oo. Similarly, if 5 \ 0, then ks y kq 
and it follows easily, e.g. by Theorem 16. 4( that {{Tf^gW — > II^kqII, and thus 
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Acr(^) — > Acr(O) = 1/8. (The contrary assertion in [HSl is incorrect; see the 
erratum.) 

16.6. Functions of max{x,y}. In several of the examples above (see Sub- 
sections ESI and ll6.5|l we have 5 = (0, 1], ;U is the Lebesgue measure 
and n{x, y) = (j){x V y) for some function > on (0, 1]. The integral oper- 
ators Tk with such kernels have been studied by Maz'ya and Verbitsky jjl] 
and Aleksandrov, Janson, Peller and Rochberg pi. In particular, these pa- 
pers prove that is bounded if and only if supj.>o x 0(y)^ dy < oo, and 
that is compact if and only if x 4>{y)'^ ^ as x ^ 0. 

In the case when (j) is decreasing (as in the examples above), these criteria 
simplify to (t){x) = 0{x~^) and 4){x) = o{x~^) as x — > 0, respectively. 

Unfortunately, there is no general formula known for the norm of T^. 
(However, the criteria just given extend to estimates within constant factors; 
for example, if (j) is decreasing, then sup(x(/)(x)) < \\Ti^\\ < 4sup(x(/)(x)).) 
In the compact case, at least if <j) has a continuous derivative on (0, 1], the 
eigenvalues, and thus the norm, can be found by studying a Sturm-Liouville 
equation. In fact, g is an eigenfunction with eigenvalue A if 

Xg{x)=(^{x) g{y)dy+ (t>{y)g{y)dy. (16.22) 

Jo Jx 

If A 7^ 0, it is easily seen that then g G C^(0, 1] and, by differentiating, that 
(|16.22j) is equivalent to 

g{x) = G'{x), XG"{x) = (t>'{x)G{x), (16.23) 

with the boundary conditions G(0) = 0, G'(l) = A-V(1)G'(1) ; see i6,. Sec- 
tion 9] and the example in Subsection 116.51 above. 

17. Paths and cycles 

Let i-fc(G) and Qk{G) be the numbers of paths and cycles, respectively, 
of length k (i.e., with k edges) in a graph G. Note that Pi{G) = e{G) is 
the number of edges, and that Qi = Q2 = ^ for simple graphs. (If we allow 
multiple edges and loops as in Remark 12. 5( the results below extend to Q2 
and, under an additional continuity assumption on k, to Qi-) 

In this section we briefly study the numbers Pk = Pk{Gn) and Qk = 
Qk{Gn), where G„ = G^(n, k). The results are easily extended to a sequence 
Kn as in Definition 12 . 91 under appropriate conditions, but we leave the details 
to the reader. 

For A; > 1 let 

"fc(^) := / K{xo,xi)K{xi,X2)---K{xk-i,Xk)dfi{xo)--- dfl{Xk), 
2 Jsk+i 



Pk{K) := TTT / l^{xi,X2) ■ ■ ■ K{xk-l,Xk)K{xk,Xi) dfi{xi) ■ ■ ■ dfi{xk). 
2k .Igk 
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Note that ak{K) = ^(1,T^1). Clearly, ak{K) and Pk{n) may be infinite. 
In this case, the limiting statements in the result below have their natural 
interpretations. 

Theorem 17.1. Let k be an a.e. continuous kernel on a (generalized) vertex 
space V, and let Gn = G^(n, k). 

(i) For k fixed, 

liminf EPfc(G„)/n > ak{K.), k>l, 

n^oo 

liminfEQfc(G„) > A; > 3. 

n^oo 

(ii) Suppose further that V is a vertex space. If k is bounded, or if 
xi, . . . ,Xn are i.i.d. random points with the distribution fi, then 

EPk{Gn)/n^ak{K), k > 1, (17.1) 

IEQfc(G„) A;>3. (17.2) 

Moreover, whenever ()17.1|) holds and ak{K,) is finite, 

Pu{Gn)/n^ak{n). (17.3) 
Similarly, whenever (|17.2j) holds and the Pki^) are finite, 

Qk{Gn)^Po{^k{K)), k>3, (17.4) 
jointly for all k > 3 with independent limits. 

Proof. The argument for parts (i) and (ii) is as in the proof of Lemma 18.11 
(a special case), considering first the regular finitary case and then approxi- 
mating with K~ and k+ defined in 1)7. 2(1 and (|7.3() : we omit the details. The 
case of i.i.d. Xi with the distribution /j, is immediate. 

The convergence p7.3() and the asymptotic (joint) Poisson distribution 
(|17.4() of Qk{Gn) follow easily in the regular finitary case (first conditioning 
on x„ as in Remark 18.81 if V is a generalized vertex space), for example 
by the method of moments as for G{n,p), cf. |16l I59j . The general cases 
then follow by appealing to Billingsley I13J Theorem 4.2], noting that if 
K~ is defined by (|7.2|) . then afc(K^J oikin) and (3k{f^m) ~^ Pk{i^) by the 
monotone convergence theorem, while, from the assumption (|17.2j) and part 
(i) (applied to k~), 

limsupE|Qfc(G(n,K)) - (5A..(G(n, k~ )) | 

n— >oo 

= lim EQfe(G(n, k)) - liminf E(5fc(G(n, k;;)) < Pki^) - Pkin'^) ^ 
as m — > oo; an analogous bound holds for Pk/n. □ 

Part (ii) holds for many generalized vertex spaces too, but not for all. In- 
deed, Remark 18.21 extends easily to the present situation, although, writing 
Vn for the number of vertices of G^ {n, k), the variance condition Var(t;„/n) — > 
should be replaced by a higher moment condition E(t;^/n^) — > /u(5)^, with 
£ = k + 1 for (|17.1j) and £ = k for (|17.2j) . It is easily seen that (fTTT|l and 
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(|17.2|) hold in the situation in Example 18.51 too. However, these relations 
may fail for the counterexample in Example 18.61 

Khokhlov and Kolchin |641 165j studied a model closely related to the rank 
1 case of G^{n,K): each vertex has an activity Oj, and edges are added 
one by one, with the endpoints of the edge chosen independently, and the 
probability that a vertex is chosen proportional to its activity. They proved 
results about the distribution of the numbers of short cycles in this model 
corresponding to the last part of Theorem 117. 11 

Proposition 17.2. (i) // \\Tf^\\ < 1, then qa;(k) < 1/2 for every k. 
(ii) // ||Tk|| > 1, then ak{K) — > oo as k ^ oo. 

Consequently, if k is a graphical kernel on a vertex space V, then {n, k) 
has a giant component if and only if sup afc('^) = c>o. 

Proof. The first statement is immediate, as (1, T^l) < For the second 

statement, we argue as in the proof of Lemma 15.161 there is a bounded 
kernel kat < k with llT^j^H > 1, and T^j^ has a bounded eigenfunction ip 
with eigenvalue A > 1. Taking ||V'||oo = 1 we have (l,r^^l) > (l,T^^j^l) > 
(IjTj^^tp) = X^{l,ip) oo. The final statement follows by Theorem Ivj.lf i'l. 
which states that G^(n, k) has a giant component if and only if HTkH > 1. □ 

Similarly, at least when fj < oo, we have the following consequence of 
well-known properties of Hilbert-Schmidt operators (cf. Lemma l5.15|) . 

Proposition 17.3. // // < oo, then is compact and self-adjoint, and 
if Xi are its (real) eigenvalues (counted with multiplicities), then 

Pk{^) = ^ Tr(r,^) = ^ E < ^ ^ 2. 

i 

Example 17.4. As in Example 14.61 let S = (0, 1] (regarded as a circle) 
with jj, Lebesgue measure, and K{x,y) = h[x — y), with /i > an even 
periodic function that is integrable over (0, 1] . Then is the convolution 
operator f ^ h * f with eigenvalues h{j) = Jq e~'^'^^^^h{x) dx, j S Z, so 

Considering functions h with small support, we can obtain arbitrarily 
large Pk with k bounded and jj k, = 1. Alternatively, we can take h{j) = 
l/ln(2 + |j|), for example; this defines an integrable function h > which 
is continuous except at Theorems V.(1.5) and V.(1.8)], and thus a 
kernel k with \\Tk\\ = J^hKoo but PkicK) = oo for every k > 2 and c > 0. 

Example 17.5. Let K{x,y) = c/{xV y) on 5 = (0, 1] as in Subsection 116.11 
Then (3k{i^) = for every k and every c > 0; indeed, if, say, Xj = i/n, 
the expected number of /c-cycles with vertices in {2~^~^n,2~^n) tends to 
a positive constant independent of m > 0, and thus EQ^ — > oo. 

The same holds for K{x,y) = c(l/(x V y) — 1) as in 1)16. 7(1 and the 5 = 
case of (Un^Il- 
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Example 17.6. Let k{x, y) = (j){x\/y) with S = (0, 1], as in Subsection lKi.fiL 
and assume that </> > is non-increasing with x(f){x) bounded. Assume that 
X]^, . . . , Xfi are i.i. d. and uniformly distributed on (0,1]. (It can be checked 
that the same conclusions hold for Xi = i/n.) Then, results of Aleksandrov, 
Janson, Peller and Rochberg ^6, Theorems 4.1 and 4.6] imply that is a 
bounded positive operator, and it is compact with eigenvalues Aj satisfying 
< oo (which means that belongs to the Schatten-von Neumann 

class Sk) if and only if [x(l){x))'' / x dx < oo. 

By Theorem ll7.lL we have EQ^ — > [3k{n) < oo for > 3. Proposition ll7.3l 
assumes J J k? < oo, but it can be shown (using truncations of (j)) that the 
result extends to the present situation; hence Pki^-) < cxd if and only if 
Jq [x(l){x)) /xdx <oo. 

Consequently, we may for any given £ choose (j) such that /3fc = oo for 
3 < A; < ^ but /3fc < oo for A; > ^. 

Under suitable conditions, the expected total number of cycles converges 
to X]^3/3fc; we omit the details. By Proposition 117.31 this sum is given by 
the following formula. 

Corollary 17.7. // jj <oo, then 

/E,,(-^M1 - A^) - \X^ - \>^) < oo, ^f \\T4 < 1, 
\ oo, if \\T^\\ > 1. 



k=3 

The sum on the right-hand side can be written as — ^ ln(det3(I — T^)), 
where dets is a renormalized Fredholm determinant |S7[ §9]. 

Remark 17.8. Turova [HII studies the number of cycles in the random 
graph discussed in Subsection 116.51 including a formula for limEQfc = (3k- 
She conjectures that the threshold for the existence of a giant component 
is the same as the threshold for Ylkl^k = oo. (This conjecture inspired 
the present section.) We now see from Corollary 117.71 that this is true 
in great generality; for example, if k is bounded, then Theorem 117.11 and 
Proposition 117. 3l implv that the threshold cq = Hr^H"^ in Corollarv 13 . 21 mav 
be written as 

CO 

Co = sup|c : ^^/?fc(cK;) < oo|. (17.5) 

A:=3 

Note, however, that exactly at the threshold, i.e., for G(n, k) with \\T^\\ = 1, 
there is no giant component although Yli'k'='i l^k{i^) = oo. Moreover, the 
relation p7.5)) may fail for unbounded k, see the examples above. In Tur- 
ova's case p6.'21j) . the relation (|17.5j) holds for 5 > (when is Hilbert- 
Schmidt), but not for (5 = (when is not compact), see Example 117.51 

18. Further remarks 



Random graphs defined via kernels appear in various other contexts. One 
natural example is the 'dense' case: let k be a symmetric function from 
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[0, 1]2 to [0, 1] with some suitable 'smoothness' property, and form a graph 
on [n] by taking the probabihty pij of the edge ij to be K{xi,Xj), where 
Xi is the type of vertex i (e.g., Xi = i/n), and different edges are present 
independently. Thus, when k = p is constant, one recovers the dense Erdos- 
Renyi graph G{n,p). The study of this inhomogeneous dense model is as 
far from the concerns of the present paper as the study of G{n, 1/2), say, is 
from the study of G{n, c/n). 

Another case, dense but not so dense, is obtained from our model if we 
omit the restriction that k E L^{S x S,fi x fi). One particular example 
that might have interesting behaviour is K,{x,y) = l/\x — y\, with Xi = i/n, 
say, so Pij = l/\i — j\, for i ^ j. A similar model (in the rank 1 case of 
Subsection 116.4(1 has been studied by Norros and Reittu jS2]- Newman and 
Schulman [73j studied percolation in a closely related infinite random graph: 
two vertices i,j £ Z are joined with probability 1 — exp(— /3|i — where 
P and s are parameters of the model. 

Models with intermediate density (a number of edges that is more than 
linear but less than quadratic in the number n of vertices) could be ob- 
tained by defining the edge probabilities pij in terms of a kernel k but 
with different scaling to that in l|2.3|) . For example, we could take pij = 
mm{K{xi,Xj)/n", 1}, where < a < 1 is a fixed number, or pij = 
say. Although these definitions bear a formal resemblance to the one we 
have used, they lead to very different models. Nevertheless, these models 
may also repay close attention. In some cases such models might correspond 
to, or resemble, graphs growing in time by the addition of vertices, with the 
addition of an increasing number of edges at each step: the case pij = 
for i < j is one particular example. 

A different connection between graphs and symmetric functions W from 
[0,1]^ to [0,1] arises in the work of Lovasz and Szegedy where the 
limit of a sequence of dense graphs is defined by considering the number of 
subgraphs isomorphic to each fixed graph. 

Another natural model is the following: take the type space as [0,1]^, 
say, with the Lebesgue measure, and take the types of the vertices to be 
independent. (Or, more naturally, generate the vertex types by a Poisson 
process of intensity n, so the total number of vertices is random.) Join two 
vertices with a probability p{d, n) that is a function of d, the Euclidean dis- 
tance between the (types of the) vertices, and n. Since the typical distances 
are order n~^/^, the natural normalization is p = f{dn^^'^), for example, 
p{d,n) = ci exp(— C2d^n). 

In many ways, a model defined in this way is similar to that considered 
in this paper: if / decays sufficiently fast, the expected degrees are of order 
1, and the degree distribution will be asymptotically Poisson. However, in 
other ways this graph is very different from the ones we have been studying: 
in particular, it has many small cycles. Determining the threshold for the 
emergence of the giant component in this model is likely to be as hard as 
finding the critical probability for a planar percolation model (indeed, it is 
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essentially the same task), and is thus likely to be impossible except perhaps 
in very special cases. 



Another interesting property of a graph is the behaviour of the contact 
process on the graph. Suppose that each vertex is either susceptible, or 
infected: infected vertices infect their susceptible neighbours with rate A, 
and recover with rate 1, returning to the susceptible state. The process 
starts with a single randomly chosen infected vertex. When the average 
degree is of order 1, one might expect that there is a critical value Ac such 
that for A < Ac constant, the expected number of vertices ever infected is 
0(1), while for A > Ac constant, with probability bounded away from zero 
almost all (perhaps even all) vertices in the giant component become infected 
at some point, and the infection lasts an exponentially long time. This is 
the case for the 4-regular grid graph on the torus, say; see Liggett |H5] . 

The behaviour of the contact process on the scale-free LCD graph has 
been studied by Berger, Borgs, Chayes and Saberi |T2], who gave detailed 
results showing in particular that there is no threshold (i.e., Ac = 0). 

For G'^(n, k), one might expect a positive threshold if and only if \\Tf^\\ < 
oo. (Perhaps an additional condition would be needed, such as k bounded.) 
In fact, one can make a more detailed prediction based on the contact process 
on infinite trees: the threshold Ac should be the same as the threshold for 
the process to continue forever on an infinite tree generated by the branching 
process Xk(x). Note that it is likely that there are two distinct thresholds 
for the behaviour of the contact process on such trees (this is known only 
for certain classes of trees, including regular trees; see [2311111112]): a lower 
threshold Ai above which the process has positive probability of never dying 
out, and an upper threshold A2 above which a given vertex has a positive 
probability of becoming reinfected infinitely often. (In both cases, we start 
with a single infected vertex.) In the graph, Ai should be relevant: if the 
process survives with drift in the infinite tree, it will eventually revisit a 
given vertex of 6*^(71, k), as the neighbourhoods of a vertex are only locally 
treelike. 

Related results have been proved by Durrett and Jung 45^ for a d- 
dimensional version of the small-world model of Bollobas and Chung: the 
vertex set is a discrete torus, each vertex is connected to all vertices within 
a fixed distance, and then all pairs in a random matching of the vertices are 
added as 'long-range' edges. Durrett and Jung prove separation of Ai and 
A2 for an infinite version of this graph. Also, they show that for A > Ai, 
a modified contact process on the finite graph survives for an exponential 
time; the modification is to allow an infected vertex to infect a randomly 
chosen other vertex, at an arbitrarily small but positive rate 7. The result 
is likely to hold with 7 = 0, since the 'long-range' edges already provide 
sufficient global randomness. 
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If the definition of the contact process on a graph G is modified so that 
when a vertex recovers it cannot be reinfected, one might expect that, start- 
ing with a single infected vertex f , the set of vertices eventuahy infected 
have a 'similar' distribution to the component of G[p\ containing v, where 
G[p\ is formed from G by keeping each edge independently with probability 
p = A/(l + A). Roughly speaking, for each edge ww' of G, we may declare 
the edge ww' to be open if whichever of w and w' is first infected will try 
to infect the other before it recovers, an event with probability A/(l + A). 
The set of infected vertices is the component of v in the graph formed by 
the open edges. Unfortunately, since the probabilities of infection from w 
to w' and from w to w" both depend on the random time that w remains 
infected, the events that different edges are open are not independent. This 
fact is missed by Newman [SU], who states that this modified contact pro- 
cess is equivalent to percolation on G; we should like to thank an anonymous 
referee for drawing this paper to our attention. Nevertheless, it may still 
be true that the threshold in this modified contact process is close to the 
percolation threshold on G, at least under certain conditions. 

Appendix A. Probabilistic lemmas 

In this appendix we prove three simple technical results concerning se- 
quences of random variables. The first and third are used in the main body 
of the paper; the second is needed to prove the third. The first concerns 
random Borel measures. 

Let 5 be a separable metric space, let M{S) be the space of all finite 
(positive) Borel measures on S, and let P{S) be the subspace of all Borel 
probability measures on S. We equip M{S) and P{S) with the usual (weak) 
topology: iJ-n ^ fJ- and only if / / d^n j f dfi for every function / in 
the space Cb{S) of bounded continuous functions on S. Alternatively, as is 
well known, if and only if fini^) /^(^) for every //-continuity set 

A, i.e., every measurable set A with n{dA) = 0. 

Remark A.l. The case of probability measures is perhaps better known, 
and is treated in detail in, for example, Billingsley [^. Many results extend 
immediately to M(5), either by inspecting the proof, or because ^ // 
in M{S) if and only if fin{<S) — > fJ-iS) and either n{S) = or fin/tJ"n{<S) 
fi/f,{S) in P{S). 

The spaces P{S) and M{S) are themselves separable metric spaces; for 
P{S), see [iSl Appendix III]. 

The characterizations above of convergence in P{S) and M{S) extend to 
random measures and convergence in probability as follows; see Kallenberg 
|6H Theorem 16.16] for a similar (but stronger) theorem under a stronger 



hypothesis on S. Note that both (ii) and (iii) are special cases of (iv 



Lemma A. 2. Let S be a separable metric space, and suppose that Vn, n > 1, 
are random measures in M{S). Then the following assertions are equivalent: 
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[1 U„ 



P 



(ii) I'niA) fJ-iA) for every fj,- continuity set A; 

(iii) J f dvn ^ j f dp, for every hounded continuous function f : S ^ M.. 

(iv) J f dun ^ f f dp for every bounded ji-a.e. continuous function f : 
S ^R. 



Proof, (i) =^ (ii) If A is a /i-continuity set then u i— > I'iA) defines a mea- 
surable function M{S) M which is continuous at /x. 



(ii) 



(iv) We may suppose that < / < 1; the general case follows 



by linearity. Let N be the ;U-null set consisting of points at which / is 
discontinuous, and let At := f~^{t, oo) = {x : f{x) > t}. li x £ At \ At and 
X ^ N, then f{x) = t by continuity. Thus dAt C NU f~^{t}. Hence, the sets 
dAt \ N are disjoint, and iJ,{dAt) = except for at most countably many t. 
When fJ.{dAt) = 0, we have fn(^t) K'^t) by 



soE\un{At)-^x{At 







by dominated convergence. By dominated convergence again, 



E 



fdUn 



fdfi 



E 



UaiAt)dt 



fi{At) dt 



<E / UniAt) - f^{At) dt 



0. 



EK{At)- fi{At)\dt 



IV 



ni, 



(iii) Trivial. 



(i) The topological space M{S) is metrizable, but the topology 
is also defined by the functionals /i i— > J f dfi, f S Cb{S). Hence, if [/ is a 
neighbourhood of /j, in M{S), there is a finite set of functions /i, . . . , /at G 
Cb{S) and e > such that if |/ fidu — J fidfi\ < e, i = 1,...,N, then 
u £ U. Consequently, 



n^n^U)<Y,F( j fidv 

i=l ^ •' 



fidfi 



> £ 



□ 



Remark A. 3. To verify condition (ii) it often suffices to consider A in a 
suitably selected family of subsets. For example, it is well-known that on M, 
it suffices to consider /^-continuity sets of the form (— oo, a;] and, for M(5), 
M itself; see |E1 Section 3]. 



Recall that if Xn is a sequence of random variables and a sequence of 
positive real numbers, then = 0{an) whp means that there is a constant 
C such that < Can whp. Our final technical result (Lemma lA.5l below) 
is simple, but perhaps a little surprising: we shall show that under suitable 
assumptions, if X„ = 0(a„) holds conditionally (after conditioning on the 
sequences x„ in our model), then it holds unconditionally, i.e., that the 
implicit constant may be assumed to be deterministic. We start with a 
preparatory lemma. 
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Lemma A. 4. Let Ai,A2,---, be non-empty families of random variables 
such that for any sequence Xn G An we have Xn = 0{an) whp. Then there 
is a constant C such that supj^g^^^ 1^(1^1 > Can) ^0 as n ^ oo. In other 
words, the implicit constant in Xn = 0{an) whp can be chosen uniformly 
for Xn£A n • 

Proof. Replacing Xn by Xn/an, we may assume that a„ = 1. Suppose the 
conclusion fails. Then, for every m there is an > such that there are 
arbitrarily large n for which there is an Xn G An with P(|X„| > m) > 
Em- Let (mfc) be a sequence of integers where each positive integer appears 
infinitely many times. Select inductively an increasing sequence (n^) and 
Xn^ € An^, such that P(|X„j.| > mfc) > e^fc- For n ^ {refc}, choose Xn from 
An arbitrarily. 

For any positive integer m, there are infinitely many k such that mt = m, 
and thus infinitely many n such that P(|X„| > m) > Em- Hence (Xn) is not 
0(1) whp, which contradicts our assumption. □ 

The next lemma can be stated in terms of families of probability 

distributions (on M) and mixtures E//„(l^) of them, but we prefer a state- 
ment in terms of random variables Xn{y) ~ IJ-niv)] we consider a sequence of 
families Xn{y) of random variables defined for y in a subset Mn of a certain 
space M as this is convenient when we apply the lemma to G^{n, k). 

Lemma A. 5. Let M be a metric space, and, for each n > 1, let Xn{y), 
y € Mn ^ -A^; be a (measurable) family of real-valued random variables. 
Let yQ £ Ai, and suppose that for every sequence (yn) with yn G Mn and 
Un — *■ Ho we have Xn{yn) = 0(a„) whp. Then, if (y„) is a sequence of Un- 
valued random variables, independent of all Xn{y), with Yn ^ yo, we have 
XniYn) = 0(a„) whp. 

Proof. Since Yn yo, there is a sequence 5„ ^ such that F{d{Yn,yo) < 
6n) ^ 1. Set Un = {y £ Mn : d{y,yo) < 6n}, so P(y„ e C/„) ^ 1. 
Note that y„ G C/„ implies y„ yo, and thus Xn{yn) = 0(a„) whp. Let 
An = {Xn{y) ■ y G Un}. By Lemma fA.4l there exists C such that En ■= 
supy^u„H\Xniy)\ > Can) ^ 0. Finally, P(|X„(y„)| > Can) < En + r(Yn i 

Un) ^0. □ 
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